
S I G N AT U R E 
IN THE CELL 

D N A EVIDENCE FOR 
INTELLIGENT DESIGN 



STEPHEN C. MEYER 



Signature in the Cell 



DNA and the Evidence for Intelligent Design 



Stephen C. Meyer 



££ HarperCollins e-books 



For Elaine, Once and Future Companion 




Contents 



Chapter 1 
Chapter 2 
Chapter 3 
Chapter 4 
Chapter 5 
Chapter 6 
Chapter 7 
Chapter 8 
Chapter 9 
Chapter 10 
Chapter 1 1 
Chapter 12 
Chapter 13 
Chapter 14 
Chapter 15 
Chapter 16 
Chapter 17 
Chapter 18 
Chapter 19 
Chapter 20 



Prologue 

DNA, Darwin, and the Appearance of Design 
The Evolution of a Mystery and Why It Matters 
The Double Helix 
Signature in the Cell 
The Molecular Labyrinth 

The Origin of Science and the Possibility of Design 
Of Clues to Causes 

Chance Elimination and Pattern Recognition 
Ends and Odds 
Beyond the Reach of Chance 
Self-Organization and Biochemical Predestination 
Thinking Outside the Bonds 

Chance and Necessity, or the Cat in the Hat Comes Back 

The RNA World 

The Best Explanation 

Another Road to Rome 

But Does It Explain? 

But Is It Science? 

Sauce for the Goose 
Why It Matters 

Epilogue: A Living Science 

Appendix A: Some Predictions of Intelligent Design 



Appendix B: Multiverse Cosmology and the Origin of Life 
Notes 

Bibliography 

Searchable Terms 

Acknowledgments 

About the Author 

Praise 

Credits 

Copyright 

About the Publisher 



Prologue 



“Dad, that’s you!” my fourteen-year-old son exclaimed as he looked at the newspaper 
while we stood waiting to check out at the tiny general store. His shock at seeing my face 
in the front section of the Seattle Post-Intelligencer, when he just went to look for 
baseball scores, was no doubt compounded by his awareness of our location. 1 The 
general store on Shaw Island, one of the most remote in the San Juan chain north of Puget 
Sound, was the only commercial establishment on the island. This irony was not lost on 
my wife, whose raised eyebrow said it all. “I thought we were coming here to get away 
from all of this.” We were. But then how was I to know that the local Seattle paper would 
rerun the previous day’s front-page story from the New York Times about the program of 
scientists I directed and the controversy surrounding our work? 

The controversy about the origin of life and whether it arose from an undirected material 
process or from some kind of designing intelligence is not new. It goes back in Western 
civilization at least as far as the ancient Greeks, who produced philosophers representing 
both schools of thought. But the controversy over the contemporary theory of intelligent 
design (ID) and its implied challenge to orthodox evolutionary theory became big news 
beginning in 2004 and 2005. And, for better or worse, I found myself right in the middle 
of it. 

Three events sparked intense media interest in the subject. First, in August 2004, a 
technical journal housed at the Smithsonian Institution in Washington, D.C., called the 
Proceedings of the Biological Society of Washington published the first peer-reviewed 
article explicitly advancing the theory of intelligent design in a mainstream scientific 
periodical. After the publication of the article, the Smithsonian’s Museum of Natural 
History erupted in internal controversy, as scientists angry with the editor — an 
evolutionary biologist with two earned Ph.D.’s — questioned his editorial judgment and 
demanded his censure. Soon the controversy spilled over into the scientific press as news 
stories about the article and editor’s decision appeared in Science, Nature, The Scientist, 
and the Chronicle of Higher Education. 

The media exposure fueled further embarrassment at the Smithsonian, resulting in a 
second wave of recriminations. The editor, Richard Sternberg, lost his office and his 
access to scientific samples and was later transferred to a hostile supervisor. After 
Sternberg’s case was investigated by the U.S. Office of Special Counsel, a government 
watchdog organization, and by the U.S. House Committee on Government Reform, a 
congressional committee, other questionable actions came to light. 4 Both investigations 
found that senior administrators at the museum had interrogated Sternberg’s colleagues 
about Sternberg’s religious and political beliefs and fomented a misinformation campaign 
designed to damage his scientific reputation and encourage his resignation. 5 Sternberg 
did not resign his research appointment, but he was eventually demoted. 



As word of his mistreatment spread, the popular press began to run stories about his case. 
Ordinarily, my reaction to such reports might have been to shake my head in dismay and 
move on to the next story in the news cycle. But in this case, I couldn’t. As it happened, I 
was the author of the offending article. And some of the reporters interested in 
Sternberg’s mistreatment were coming to me with questions. They wanted to know more 
about the theory of intelligent design and why it had provoked such alarm among 
establishment scientists. 

Then in December 2004, two other events generated worldwide interest in the theory of 
intelligent design. First, a renowned British philosopher, Antony Flew, announced that he 
had repudiated a lifelong commitment to atheism, citing, among other factors, evidence 
of intelligent design in the DNA molecule. 6 Flew noted in his announcement that his 
views about the origin of life bore a striking resemblance to those of “American design 
theorists.” Again, intelligent design was in the news. But what was it? This time I found 
myself on the BBC debating a prominent evolutionary biologist about the theory. 

Later in the month, the American Civil Liberties Union (ACLU) announced a suit against 
a school board in the western Pennsylvania town of Dover. The school board had just 
announced its intention to let high school students learn about the theory of intelligent 
design. To do this, it proposed to inform students about the existence of a book in the 
school library — one that made the case for intelligent design in opposition to the standard 
evolutionary theories presented in the existing biology textbooks. When the ACLU 
announced its own intentions to sue, the national media descended upon the town en 
masse. 

The press corps covering the story no doubt already knew about the 1925 Scopes 
“monkey trial” from the fictionalized Spencer Tracy movie Inherit the Wind, if from no 
other source. In Dover they sensed they had the makings of a sequel. During 2005, all the 
major American network and cable news programs ran segments about the theory of 
intelligent design, the Dover controversy, or both. Stories not only appeared in major 
U.S. newspapers, but in papers around the world, from the Times of London, Sekai Nippo 
(Tokyo), the Times of India, and Der Spiegel to the Jerusalem Post. 

Then in August 2005, just as an end to the media buzz seemed near, a number of political 
and religious leaders — including figures as diverse as the Dalai Lama, President George 
W. Bush, and the pope — made public statements supportive of either intelligent design or 
allowing students to leam about the controversy surrounding it. When Time magazine 
followed suit with a cover story about the controversy, our phones started ringing all over 
again. 

As summer was drawing to an end, my wife and I decided it was time for our family to 
get away after friends offered us the use of their island cabin. But in the two-week period 
corresponding to our vacation, the New York Times ran its two front-page stories about 
our program at the Discovery Institute, the Washington Post broke a story about the latest 
developments in the Sternberg case, and the New York Times editorial page offered 
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criticism of Sternberg in its main staff-written editorial. After Sternberg decided to 
appear on The O ’Reilly Factor to tell his side of the story, we knew it was time to head 
back to Seattle. 8 

My temporary notoriety provided something my colleagues and I sorely needed — a 
platform for correcting much of the misinformation circulating about the theory of 
intelligent design. Many news articles and reports confused intelligent design with 
biblical creationism and its literal reading of the book of Genesis. Other articles echoed 
the talking points of our critics and portrayed our work as either “giving up on science” 
or a sneaky attempt to circumvent the legal prohibitions against teaching creationism in 
the public schools that the Supreme Court had enacted in 1987. 

Yet I knew that the modem theory of intelligent design was not developed as a legal 
strategy, still less as one to abet creationism. Instead, it was first considered in the late 
1970s and early 1980s by a group of scientists — Charles Thaxton, Walter Bradley, and 
Roger Olsen — as a possible explanation for an enduring mystery of modern biology: the 
origin of the digital infonnation encoded along the spine of the DNA molecule. 9 

As I explained repeatedly to reporters and cable-news hosts, the theory of intelligent 
design is not based on a religious text or document, even if it does have implications that 
support theistic belief (a point to which I will return in Chapter 20). Instead, intelligent 
design is an evidence-based scientific theory about life’s origins that challenges strictly 
materialistic views of evolution. 

Indeed, the theory of intelligent design challenges a specific tenet of contemporary 
evolutionary theory. According to modern neo-Darwinists such as Oxford’s Richard 
Dawkins, living systems “give the appearance of having been designed for a purpose.” 
But, to Dawkins and other contemporary Darwinists, that appearance of design is entirely 
illusory, because wholly undirected processes such as natural selection and random 
mutations can produce the intricate design-like structures in living systems. In their view, 
natural selection can mimic the powers of a designing intelligence without being guided 
or directed in any way. 

In contrast, the theory of intelligent design holds that there are tell-tale features of living 
systems and the universe that are best explained by an intelligent cause — that is, by the 
conscious choice of a rational agent — rather than by an undirected process. Either life 
arose as the result of purely undirected processes, or a guiding intelligence played a role. 
Advocates of intelligent design argue for the latter option based on evidence from the 
natural world. The theory does not challenge the idea of evolution defined as change over 
time or even common ancestry, but it does dispute the Darwinian idea that the cause of 
all biological change is wholly blind and undirected. Even so, the theory is not based on 
biblical doctrine. Intelligent design is an inference from scientific evidence, not a 
deduction from religious authority. 




Despite the opportunity I had been given in the media to clarify our position, my 
experiences left me with a sense of unfinished business. By 2005, I had devoted nearly 
twenty years of my life to developing a case for intelligent design based upon the 
discovery of the information-bearing properties — the digital code — stored in the DNA 
molecule. I had written a series of scientific and philosophical articles developing this 
idea, 10 but these articles were neither particularly accessible nor gathered into one 
volume. Now I repeatedly found myself in the position of having to defend an argument 
in sound bites that my audience did not know well enough to evaluate. How could they? 
Perhaps the central argument for intelligent design, the one that first induced me to 
consider the hypothesis, had not been explained adequately to a general, scientifically 
literate audience. 

Of course, by 2005 many excellent books and articles — including several important 
peer-reviewed books — had already been published on different aspects of the theory of 
intelligent design. In 1996, Lehigh University biochemist Michael Behe made a detailed 
case for intelligent design based upon the discovery of nanotechnology in cells — such as 
the now famous bacterial flagellar motor with its thirty-part rotary engine. Behe’s 
Darwin ’s Black Box sold over a quarter of a million copies and almost single-handedly 
put the idea of intelligent design on the cultural and scientific map. In 1998, William 
Dembski, a mathematician and philosopher with two Ph.D.’s (including one from the 
University of Chicago), followed suit by publishing a groundbreaking work on methods 
of design detection. Dembski’s work, The Design Inference, published by Cambridge 
University Press, established a scientific method for distinguishing the effects of 
intelligence from the effects of undirected natural processes. His work established 
rigorous indicators of intelligent design, but did not make any specific argument for 
intelligent design based on the presence of these indicators in living organisms. 

These were seminal works, but I had become convinced of intelligent design by another 
route. Over the years, I began to develop a related, but largely independent, case for 
intelligent design. Unfortunately I had a penchant for writing long, dense essays in 
obscure journals and anthologies. Even my article in the Proceedings of the Biological 
Society of Washington attracted more attention because of the controversy at the 
Smithsonian than because of controversy over the argument itself, though there had been 
more than a bit of that in some scientific circles. 1 1 

In any case, when the national media came calling, I simply could not get them to report 
why I thought DNA pointed to intelligent design. Reporters refused to cover the 
argument in their articles or backgrounders; debate partners scrupulously avoided 
responding to it, but instead continued to recite their talking points about the dangers of 
“intelligent design creationism.” Even the judge in the Dover case decided the scientific 
validity of intelligent design without considering the DNA evidence. 

Though I wasn’t too keen on having federal judges decide the merit of any scientific 
argument, let alone one that I favored, the Dover trial and its associated media coverage 
made me aware that I needed to make my argument in a more prominent way. Many 




evolutionary biologists had acknowledged that they could not explain the origin of the 
first life. Leading theories failed in large measure because they could not explain where 
the mysterious infonnation present in the cell came from. So it seemed there were no 
good counterarguments to the case I wanted to make. Yet various avoidance strategies 
continued to work because the argument did not have sufficient public prominence to 
force a response. Too few people in the public, the scientific community, and the media 
even knew about it. And yet it provided — arguably — one of the most important and 
fundamental reasons for considering intelligent design. 

None of this was actually too surprising. Since World War II, scientists have stressed the 
importance of publishing their work in specialized peer-reviewed journals, but 
throughout the history of science “paradigm-shifting” ideas and theories have typically 
been presented in books, including many that we might now call “trade press” (rather 
than academic) books. 

There are a couple of reasons for this. First, books allow scientists to make sustained and 
comprehensive arguments for synthetic new ideas. As the Italian philosopher of science 
Marcello Pera has shown, scientists often argue about competing interpretations of the 
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evidence. Although this is sometimes done successfully in short articles — as Einstein 
did in making his case for special and general relativity and Watson and Crick did in their 
nine-hundred-word article proposing a double helix structure for DNA — books have 
often been the go-to genre for presenting and evaluating new arguments for synthetic 
interpretations of a relevant body of evidence. 

Perhaps, the best-known example of this form of scientific discourse was provided by 
Charles Darwin himself, who famously described his work in On the Origin of Species by 
Means of Natural Section as “one long argument.” 13 There, Darwin proposed a 
comprehensive interpretation of many diverse lines of evidence. He also argued for the 
superior explanatory power of his theory and its two key propositions: (1) the creative 
power of natural selection and, (2) the descent of all life from a common ancestor. As 
part of his case, he also argued against the explanatory adequacy of rival interpretations 
of the evidence and refuted arguments for them. Other scientists such as Newton, 
Copernicus, Galileo, and Lyell as well as a host of lesser figures have used books to 
advance scientific arguments in favor of novel and comprehensive interpretations of the 
scientific evidence in their disciplines. 

There are other reasons that books are used to advance paradigm-shifting ideas. New 
scientific theories often synthesize a broad range of evidence from many related 
disciplines or subdisciplines of science. As such, they are often inherently 
interdisciplinary in scope. On the Origin of Species incorporated data from several 
disciplines, including embryology, paleontology, comparative anatomy, and 
biogeography. Modem scientific journals, typically focused as they are on topics within a 
narrowly defined subdiscipline, rarely permit the kind of comprehensive review and 
assessment of evidence that the advancement of a new interpretive framework requires. 




Additionally, by creating a larger audience for a new idea, a book, and particularly a 
popular trade book, can go over the heads of an entrenched establishment to force the 
reevaluation of an established theory by creating wider interest in its standing. Darwin 
did this by publishing On the Origin of Species by Means of Natural Selection with John 
Murray, a prominent trade press in Victorian England. Michael Behe has done this as 
well. By making a case for intelligent design based upon various examples of 
nanotechnology in the cell, Behe’s book focused international attention on the problem 
that complex systems have posed for neo-Darwinism. It also gave the theory of intelligent 
design public and, arguably, scientific standing. 

This book makes a case for that same idea. It does so, however, on the basis of a different 
class of evidence: the infonnation — the digital code — stored in DNA and the other large 
biological molecules. The case I make for intelligent design is less well known than 
Professor Behe’s and, therefore, to many completely new. Even so, it is not based upon a 
new discovery. It is, instead, based upon one of the most famous breakthroughs of 
modem biology: the discovery in 1953 of the infonnation-bearing capacities of the DNA 
molecule, what I call the “signature in the cell.” 

In 2005, when I was repeatedly placed in the position of defending the theory of 
intelligent design in the media, the argument that I most wanted to make in its favor had 
little public standing. I have written this book to remedy that deficiency. This book 
attempts to make a comprehensive, interdisciplinary argument for a new view of the 
origin of life. It makes “one long argument” for the theory of intelligent design. 

Before coming to work full-time at the Discovery Institute, I worked for twelve years as a 
college professor. In teaching I’ve found that it is often easier to understand a scientific 
theory if one can follow the historical progression of thought that led to its fonnulation. 
Following a story of discovery is not only more engaging, it can also illuminate the 
process of reasoning by which investigators came to their conclusions. For this reason, 
I’ve chosen to present my case for intelligent design in the context of a larger historical 
and personal narrative. 

Thus, Signature in the Cell does not just make an argument; it also tells a story, a mystery 
story and the story of my engagement with it. It tells about the mystery that has 
surrounded the discovery of the digital code in DNA and how that discovery has 
confounded repeated attempts to explain the origin of the first life on earth. Throughout 
the book I will call this mystery “the DNA enigma.” 

A brief word about the organization of the book: in Chapters 1 and 2 I define the 
scientific and philosophical issues at stake in the DNA enigma and give some historical 
background about the larger origin-of-life debate. In Chapters 3, 4, and 5 I describe the 
mystery surrounding DNA in more detail in order to establish what it is that any theory of 
the origin of life must explain. After a short interlude in Chapters 6 and 7 in which I 
examine what scientists in the past have thought about biological origins and how 
scientists currently investigate these questions, I examine (in Chapters 8 through 14) the 




competing explanations for the origin of biological infonnation. Then, in Chapters 15 and 
16, 1 present a positive case for intelligent design as the best explanation for the origin of 
the infonnation necessary to produce the first life. Finally, in Chapters 17 through 20, I 
defend the theory of intelligent design against various popular objections to it. In the 
Epilogue, I show that intelligent design offers a fruitful approach to future scientific 
research. Not only does it illuminate some very recent and surprising discoveries in 
genomics, but it also suggests productive new lines of scientific investigation for many 
subdisciplines of biology. 

My interest in the DNA enigma stretches back nearly twenty-five years. And though 
there were times (particularly in 2005) when I was frustrated with myself for not having 
already produced this work, my protracted production schedule has had at least two 
unintended advantages. First, it has given me the opportunity to engage in both private 
conversation and public debate with some of the leading scientific figures involved in this 
controversy. That has made it possible for me to present what I hope is an unusually 
thorough analysis of the competing explanations for the origin of the information in 
living cells. 

Second, because of the timing of its release, this book may contribute to the ongoing 
assessment of Darwin’s legacy just when many scientists, scholars, reporters, and others 
will be doing so. This year marks the 200th anniversary of Darwin’s birth and the 150th 
anniversary of the publication of On the Origin of Species. In the Origin, Darwin 
accomplished many things. He introduced a new framework for understanding the history 
of life. He identified a new mechanism of biological change. And, according to many 
scholars and scientists, he also refuted the scientific argument for design. He did this by 
explaining away any presumed vestiges of an actual designing intelligence, showing 
instead that these “appearances of design” had been produced by a purely undirected 
process — indeed, one that could mimic the powers of a designing mind. As evolutionary 
biologist Francisco Ayala has recently explained, Darwin explained the appearance of 
design without recourse to an actual designer. He gave us “design without a designer.” 14 
But is this really true? Even if we grant Darwin’s argument in the Origin, does it really 
follow that he refuted the design hypothesis? This book will present a fresh perspective 
on this question by examining one of the most enduring mysteries of modern biology. 
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DNA, Darwin, and the Appearance of Design 



When James Watson and Francis Crick elucidated the structure of DNA in 1953, they 
solved one mystery, but created another. 

For almost a hundred years after the publication of On the Origin of Species by Charles 
Darwin in 1859, the science of biology rested secure in the knowledge that it had 
explained one of humankind’s most enduring enigmas. From ancient times, observers of 
living organisms had noted that living things display organized structures that give the 
appearance of having been deliberately arranged or designed for a purpose, for example, 
the elegant fonn and protective covering of the coiled nautilus, the interdependent parts 
of the eye, the interlocking bones, muscles, and feathers of a bird wing. For the most part, 
observers took these appearances of design as genuine. Observations of such structures 
led thinkers as diverse as Plato and Aristotle, Cicero and Maimonides, Boyle and Newton 
to conclude that behind the exquisite structures of the living world was a designing 
intelligence. As Newton wrote in his masterpiece The Opticks: “How came the Bodies of 
Animals to be contrived with so much Art, and for what ends were their several parts? 
Was the Eye contrived without Skill in Opticks, and the Ear without Knowledge of 
Sounds?... And these things being rightly dispatch’d, does it not appear from Pham omen a 
that there is a Being incorporeal, living, intelligent...?” 1 

But with the advent of Darwin, modern science seemed able to explain this appearance of 
design as the product of a purely undirected process. In the Origin, Darwin argued that 
the striking appearance of design in living organisms — in particular, the way they are so 
well adapted to their environments — could be explained by natural selection working on 
random variations, a purely undirected process that nevertheless mimicked the powers of 
a designing intelligence. Since then the appearance of design in living things has been 
understood by most biologists to be an illusion — a powerfully suggestive illusion, but an 
illusion nonetheless. As Crick himself put it thirty-five years after he and Watson 
discerned the structure of DNA, biologists must “constantly keep in mind that what they 
see was not designed, but rather evolved.” 

But due in large measure to Watson and Crick’s own discovery of the 
information-bearing properties of DNA, scientists have become increasingly and, in some 
quarters, acutely aware that there is at least one appearance of design in biology that may 
not yet have been adequately explained by natural selection or any other purely natural 
mechanism. Indeed, when Watson and Crick discovered the structure of DNA, they also 
discovered that DNA stores information using a four-character chemical alphabet. Strings 
of precisely sequenced chemicals called nucleotide bases store and transmit the assembly 



instructions — the information — for building the crucial protein molecules and machines 
the cell needs to survive. 



Crick later developed this idea in his famous “sequence hypothesis,” according to which 
the chemical parts of DNA (the nucleotide bases) function like letters in a written 
language or symbols in a computer code. Just as letters in an English sentence or digital 
characters in a computer program may convey infonnation depending on their 
arrangement, so too do certain sequences of chemical bases along the spine of the DNA 
molecule convey precise instructions for building proteins. Like the precisely arranged 
zeros and ones in a computer program, the chemical bases in DNA convey information in 
virtue of their “specificity.” As Richard Dawkins notes, “The machine code of the genes 
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is uncannily computer-like.” Software developer Bill Gates goes further: “DNA is like a 
computer program but far, far more advanced than any software ever created .” 4 

But if this is true, how did the information in DNA arise? Is this striking appearance of 
design the product of actual design or of a natural process that can mimic the powers of a 
designing intelligence? As it turns out, this question is related to a long-standing mystery 
in biology — the question of the origin of the first life. Indeed, since Watson and Crick’s 
discovery, scientists have increasingly come to understand the centrality of infonnation to 
even the simplest living systems. DNA stores the assembly instructions for building the 
many crucial proteins and protein machines that service and maintain even the most 
primitive one-celled organisms. It follows that building a living cell in the first place 
requires assembly instructions stored in DNA or some equivalent molecule. As 
origin-of-life researcher Bemd-Olaf Kuppers explains, “The problem of the origin of life 
is clearly basically equivalent to the problem of the origin of biological information .” 5 





Figure 1.1. James Watson and Francis Crick at the Cavendish Laboratory in Cambridge. 
Courtesy of Barrington Brown/Photo Researchers, Inc. 



Much has been discovered in molecular and cell biology since Watson and Crick’s 
revolutionary discovery more than fifty years ago, but these discoveries have deepened 
rather than mitigated the enigma of DNA. Indeed, the problem of the origin of life (and 
the origin of the information needed to produce it) remains so vexing that Harvard 
University recently announced a $100 million research program to address it. 6 When 
Watson and Crick discovered the structure and information-bearing properties of DNA, 
they did indeed solve one mystery, namely, the secret of how the cell stores and transmits 
hereditary information. But they uncovered another mystery that remains with us to this 
day. This is the DNA enigma — the mystery of the origin of the information needed to 
build the first living organism. 




In one respect, of course, the growing awareness of the reality of infonnation within 
living things makes life seem more comprehensible. We live in a technological culture 
familiar with the utility of information. We buy infonnation; we sell it; and we send it 
down wires. We devise machines to store and retrieve it. We pay programmers and 
writers to create it. And we enact laws to protect the “intellectual property” of those who 
do. Our actions show that we not only value information, but that we regard it as a real 
entity, on par with matter and energy. 

That living systems also contain information and depend on it for their existence makes it 
possible for us to understand the function of biological organisms by reference to our 
own familiar technology. Biologists have also come to understand the utility of 
information, in particular, for the operation of living systems. After the early 1960s 
advances in the field of molecular biology made clear that the digital information in DNA 
was only part of a complex infonnation-processing system, an advanced fonn of 
nanotechnology that mirrors and exceeds our own in its complexity, storage density, and 
logic of design. Over the last fifty years, biology has advanced as scientists have come to 
understand more about how infonnation in the cell is stored, transferred, edited, and used 
to construct sophisticated machines and circuits made of proteins. 

The importance of information to the study of life is perhaps nowhere more obvious than 
in the emerging fields of genomics and bioinfonnatics. Over the last decade, scientists 
involved in these disciplines have begun to map — character by character — the complete 
sequence of the genetic instructions stored on the human genome and those of many other 
species. With the completion of the Human Genome Project in 2000, the emerging field 
of bioinfonnatics entered a new era of public interest. News organizations around the 
world earned President Clinton’s announcement of the project’s completion on the White 
House lawn as Francis Collins, scientific director of the project, described the genome as 
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a “book,” a repository of “instructions,” and the “book of life.” The Human Genome 
Project, perhaps more than any discovery since the elucidation of the structure of DNA in 
1953, has heightened public awareness of the importance of information to living things. 
If Watson and Crick’s discovery showed that DNA stores a genetic text, Francis Collins 
and his team took a huge step toward deciphering its message. Biology has irrevocably 
entered an information age. 

In another way, however, the reality of information within living things makes life seem 
more mysterious. For one thing, it is difficult to understand exactly what information is. 
When a personal assistant in New York types a dictation and then prints and sends the 
result via fax to Los Angeles, some thing will arrive in L.A. But that thing — the paper 
coming out of the fax machine — did not originate in New York. Only the infonnation on 
the paper came from New York. No single physical substance — not the air that carried 
the boss’s words to the dictaphone, or the recording tape in the tiny machine, or the paper 
that entered the fax in New York, or the ink on the paper coming out of the fax in Los 
Angeles — traveled all the way from sender to receiver. Yet something did. 




The elusive character of information — whether biological or otherwise — has made it 
difficult to define by reference to standard scientific categories. As evolutionary biologist 
George Williams notes, “You can speak of galaxies and particles of dust in the same 
tenns because they both have mass and charge and length and width. [But] you can’t do 
that with infonnation and matter.” 8 A bla nk magnetic tape, for example, weighs just as 
much as one “loaded” with new software — or with the entire sequence of the human 
genome. Though these tapes differ in infonnation content (and value), they do not do so 
because of differences in their material composition or mass. As Williams concludes, 
“Information doesn’t have mass or charge or length in millimeters. Likewise matter 
doesn’t have bytes.... This dearth of shared descriptors makes matter and information 
two separate domains.” 9 

When scientists during the late 1940s began to define information, they did not make 
reference to physical parameters such as mass, charge, or watts. Instead, they defined 
information by reference to a psychological state — the reduction of uncertainty — which 
they proposed to measure using the mathematical concept of probability. The more 
improbable a sequence of characters or signals, the more uncertainty it reduces, and thus 
the more infonnation it conveys. 10 

Not surprisingly, some writers have come close to equating information with thought 
itself. The infonnation technology guru George Gilder, for example, notes that 
developments in fiber optics have allowed more and more information to travel down 
smaller and smaller (and lighter and lighter) wires. Thus, he notes that as technology 
advances, we convey ever more thought across ever less matter — where the numerator in 
that ratio, namely, thought, corresponds precisely to information. 1 1 

So should we think of information as thought — as a kind of mental chimera etched in 
stone or burned onto compact discs? Or can we define information less abstractly as, 
perhaps, just an improbable arrangement of matter? 

Whatever information is — whether thought or an elaborate arrangement of matter — one 
thing seems clear. What humans recognize as infonnation certainly originates from 
thought — from conscious or intelligent activity. A message received via fax by one 
person first arose as an idea in the mind of another. The software stored and sold on a 
compact disc resulted from the design of a software engineer. The great works of 
literature began first as ideas in the minds of writers — Tolstoy, Austen, or Donne. Our 
experience of the world shows that what we recognize as information invariably reflects 
the prior activity of conscious and intelligent persons. 

What, then, should we make of the presence of infonnation in living organisms? The 
Human Genome Project, among many other developments in modern biology, has 
pressed this question to the forefront of public awareness. We now know that we do not 
just create information in our own technology; we also find it in our biology — and, 
indeed, in the cells of every living organism on earth. But how did this information arise? 




And what does the presence of infonnation in even the simplest living cell imply about 
life and its origin? Who or what “wrote” the book of life? 

The infonnation age in biology officially began in the mid-1950s with the elucidation of 
the chemical structure and information-bearing properties of DNA (deoxyribonucleic 
acid) — the molecule of heredity. Beginning in 1953 with their now famous 
communication to the British scientific journal Nature, James Watson and Francis Crick 
identified DNA as the molecular repository of genetic information. 1 ” Subsequent 
developments in the field of molecular biology confirmed this idea and showed that the 
precisely sequenced bases attached to the helical backbone of DNA store the information 
for building proteins — the sophisticated enzymes and machines that service the cells in 
all living things. 

Though the discovery of the information-bearing properties of DNA dates back over a 
half century, the recognition of the full significance of this discovery has been slow in 
coming. Many scientists have found it difficult to relinquish an exclusive reliance upon 
the more traditional scientific categories of matter and energy. As George Williams 
(himself an evolutionary biologist) notes, “Evolutionary biologists have failed to realize 
that they work with two more or less incommensurable domains: that of information and 
that of matter.... The gene is a package of information, not an object. The pattern of base 
pairs in a DNA molecule specifies the gene. But the DNA molecule is the medium, it’s 
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not the message.” 

Yet this recognition begs deeper questions. What does it mean when we find information 
in natural objects — living cells — that we did not ourselves design or create? As the 
information theorist Hubert Yockey observes, the “genetic code is constructed to confront 
and solve the problems of communication and recording by the same principles 
found... in modem communication and computer codes.” Yockey notes that “the 
technology of infonnation theory and coding theory has been in place in biology for at 
least 3.85 billion years,” or from the time that life first originated on earth. 14 What should 
we make of this fact? How did the information in life first arise? 

Our commonsense reasoning might lead us to conclude that the information necessary to 
the first life, like the infonnation in human technology or literature, arose from a 
designing intelligence. But modem evolutionary biology rejects this idea. Many 
evolutionary biologists admit, of course, that living organisms “appear to have been 
carefully and artfully designed,” as Richard Lewontin puts it. 15 As Richard Dawkins 
states, “Biology is the study of complex things that appear to have been designed for a 
purpose.” 16 Nevertheless, Lewontin and Dawkins, like evolutionary biologists generally, 
insist that the appearance of design in life is illusory. Life, they say, looks designed, but 
was not designed by an actual intelligent or purposive agent. 



Darwin’s Designer Substitute 




Why do evolutionary biologists so confidently assert that the appearance of design in 
living organisms is illusory? Of course, the answer to this question is well known. 
Evolutionary biologists have a theory that can apparently explain, or explain away, the 
appearance of design without invoking an actual designer. According to classical 
Darwinism, and now modern neo-Darwinism, the mechanism of natural selection acting 
on random variations (or mutations) can mimic the effects of intelligence, even though 
the mechanism is, of course, entirely blind, impersonal, and undirected. 




Figure 1.2. English naturalist Charles Robert Darwin (1809-82), age seventy-two. 
Courtesy of SPL/Photo Researchers, Inc. 



Darwin developed his principle of natural selection by drawing on an analogy with 
artificial selection: the process of selective breeding to change the characteristics 
(whether anatomical, physiological, or behavioral) of a group of organisms. For example, 
a farmer might observe that some of his young stallions are faster than others. If he 
allows only the fastest of these to breed with the fastest mares, then, after several 
generations of selective breeding, he will own a small group of speedy “thoroughbreds” 
suitable for racing on the Downs. 

Darwin realized that nature could imitate this process of selective breeding. The presence 
of unusually fast predatory wild cats would imperil all but the fastest horses in a wild 



herd. After several generations of such predatory challenge, the speed of the remaining 
herd might exhibit a discemable increase. Thus, environmental forces (predators, changes 
in weather, competition for food, etc.) could accomplish the work of a human breeder. By 
causing a population to adapt to its environment, blind forces of nature could come to 
mimic, over time, the action of a selecting or designing intelligence. 

Yet if natural selection, as Darwin called this process, could improve the speed of a horse 
or an antelope, why couldn’t it also produce those animals in the first place? “Reason,” 
wrote Darwin “ought to conquer... imagination” 18 — namely, our incredulity about the 
possibility of such happenings and our impression that living things appear to have been 
designed. According to Darwin, if given enough time, nature’s selective power might act 
on any variation perfecting any structure or function far beyond what any human could 
accomplish. Thus, the complex systems in life that we reflexively attribute to intelligence 
have wholly natural causes. As Darwin explained, “There seems to be no more design in 
the variability of organic beings, and in the action of natural selection, than in the course 
which the wind blows.” 19 Or as evolutionary biologist Francisco Ayala explains, “The 
functional design of organisms and their features would... seem to argue for the existence 
of a designer. It was Darwin’s greatest accomplishment [however] to show that the 
directive organization of living beings can be explained as the result of a natural process, 
natural selection, without any need to resort to a Creator or other external agent.” 0 Thus, 
Ayala and other Darwinian biologists not only affirm that natural selection can produce 
“design without a designer,” they also assert that it is “creative without being 
conscious.” 21 

The Appearance of Design 

To many outside evolutionary biology, the claim that design arises without a designer 
may seem inherently contradictory. Yet, in theory at least, the possibility that life is not 
what it seems represents nothing particularly unusual. Science often shows that our 
perceptions of nature do not match reality. A straight pencil appears bent when inserted 
in a glass of water; the sun appears to circle the earth; and the continents appear 
immobile. Perhaps, living organisms only appear to be designed. 

Even so, there is something curious about the scientific denial of our ordinary intuition 
about living things. For almost a hundred and fifty years, since its putative explanation by 
Darwinian theory, this impression of design persists as incorrigibly as ever. Public 
opinion polls suggest that nearly 90 percent of the American public does not accept the 
full-fledged neo-Darwinian account of evolution with its denial of any role for a 
purposeful creator. Though many of these people accept some form of evolutionary 
change and have a high view of science generally, they apparently cannot bring 
themselves to repudiate their deepest intuitions and convictions about the design of the 
living world. In every generation since the 1860s, scientific critics of Darwinism and 
neo-Darwinism have arisen marshaling serious evidential objections to the theory. Since 
the 1980s a growing number of scientists and scholars have expressed deep reservations 
about both biological and chemical evolutionary theory, each with their implicit denial of 




design. And even orthodox evolutionary biologists admit the overwhelming impression 
of design in modern organisms. To quote Francis Crick again, “Biologists must 
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constantly keep in mind that what they see was not designed, but rather evolved.” 

Perhaps more curiously, modem biologists can scarcely describe living organisms 

without resorting to language that seems to imply the very thing they explicitly deny: 

intentional and purposive design. As philosopher of science Michael Ruse notes, 

biologists ask about “the purpose of the fins on the back of the stegosaurus” or “the 

function of the bird’s feathers” and discuss whether “the Irish elk’s antlers did or did not 

exist in order to intimidate rivals.” “It is true,” Ruse continues, “that during the 

nineteenth century [some physicists] suggested that the moon exists in order to light the 

way home of lonely travelers, but no physicist would use such language today. In 

biology, however, especially evolutionary biology, this kind of talk is commonplace.” He 

concludes, “The world of the evolutionist is drenched in the anthropomorphism of 

intention.” And yet “paradoxically, even the severest critics” of such intentional language 
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slip into it “for the sake of convenience.”" 

In theory, at least, the use of such metaphor in science derives from ignorance. Physicists 
talk about gravitational “attraction,” because they don’t really know what causes action at 
a distance. Metaphors reign where mystery resides. Yet, on these grounds, we might have 
expected that as biology advanced, as new discoveries explicated the molecular basis of 
biological functions, biology’s reliance upon the language of purpose, upon teleological 
metaphor, might have diminished. Yet the very opposite has taken place. The advent of 
the most reductionistic subdiscipline of modern biology — molecular biology — has only 
deepened our dependence on teleological language. 

In fact, molecular biologists have introduced a new “high-tech” teleology, taking 
expressions, often self-consciously, from communication theory, electrical engineering, 
and computer science. The vocabulary of modern molecular and cell biology includes 
apparently accurate descriptive terms that nevertheless seem laden with a “meta-physics 
of intention”: “genetic code,” “genetic information,” “transcription,” “translation,” 
“editing enzymes,” “signal-transduction circuitry,” “feedback loop,” and 
“information-processing system.” As Richard Dawkins notes, “Apart from differences in 
jargon, the pages of a molecular-biology journal might be interchanged with those of a 
computer-engineering journal .” 25 As if to underscore the point, University of Chicago 
cell biologist James Shapiro describes the integrated system of proteins that constitutes 
the mammalian blood-clotting system “as a powerful real-time distributed computing 
system.” In the same context he notes that many biochemical systems within the cell 
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resemble “the wiring diagram for an electronic circuit.” As the historian of biology 

Timothy Lenoir observes, “Teleological thinking has been steadfastly resisted by modem 

biology. And yet in nearly every area of research, biologists are hard pressed to find 
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language that does not impute purposiveness to living forms.” 



Thus, it seems that an acquaintance with biological organisms, to say nothing of the 
molecular biology of the cell, leads even those who repudiate design to use language that 




seems incompatible with their own reductionistic and Darwinian perspective — with their 
official denial of actual design. Although this may ultimately signify nothing, it does at 
least raise a question. Does the persistence of our perception of design, and the use of 
incorrigibly teleological language, indicate anything about the origin of life or the 
adequacy of scientific theories that deny (actual) design in the origin of living systems? 

As always, in science the answer to such questions depends entirely on the justification 
that scientists can provide for their theories. Intuitions and perceptions can be right or 
wrong. It might well be, as many in biology assure us, that public and even scientific 
doubts about evolutionary theory derive solely from ignorance or religious prejudice, and 
that teleological language reflects nothing more than a metaphor of convenience, like 
saying the sun has set behind the horizon. Yet the persistence of dissenting scientific 
opinion and the inability of biologists to avoid the language of purpose raise a pardonable 
curiosity. Have evolutionary biologists discovered the true cause of the appearance of 
design in living systems, or should we look for another? Should we trust our intuitions 
about living organisms or accept the standard evolutionary account of biological origins? 



The Origin of Biological Information 

Consider the following sequence of letters: 

AGTCTGGGACGCGCCGCCGCCATGATCATCCCTGTACGCTGCTTCACTTGT 
GGCA AGAT CGTCGGC A AC A AGT GGGAGGCTT ACCT GGGGCT GCT GCAGGC 
CGAGT AC ACCGAGGGGT GAGGCGCGGGCCGGGGCT AGGGGCT GAGT CCGC 
CGT GGGGCGCGGGCCGGGGCT GGGGGCT GAGT CCGCCCTGGGGT GCGCGC 
CGGGGCGGGAGGCGCAGCGCT GCCT GAGGCCAGCGCCCCAT GAGC AGCT 
TCAGGCCCGGCTTCTCCAGCCCCGCTCTGTGATCTGCTTTCGGGAGAACC 

This string of alphabetic characters looks as if it could be a block of encoded information, 
perhaps a section of text or machine code. That impression is entirely correct, for this 
string of characters is not just a random assortment of the four letters A, T, G, and C, but 
a representation of part of the sequence of genetic assembly instructions for building a 
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protein machine — an RNA polymerase — critical to gene expression (or information 
processing) in a living cell. 

Now consider the following string of characters: 



01010111011010000110010101101110001000000110100101 
1011 1000100000011 10100011010000110010100100000010 
0001101101111011101010111001001110011011001010010 
00000110111101100110001000000110100001110101011011 
0101100001011011100010000001100101011101100110010 
1011011100111010001110011001000000110100101110100 




This sequence also appears to be an infonnation-rich sequence, albeit written in binary 
code. As it happens, this sequence is also not just a random array of characters, but the 
first words of the Declaration of Independence (“When in the course of human 
events...”) written in the binary conversion of the American Standard Code for 
Infonnation Interchange (ASCII). In the ASCII code, short specified sequences of zeros 
and ones correspond to specific alphabetic letters, numerals, or punctuation marks. 

Though these two blocks of encoded infonnation employ different conventions (one uses 
the ASCII code, the other the genetic code), both are complex, nonrepeating sequences 
that are highly specified relative to the functional or communication requirements that 
they perform. This similarity explains, in part, Dawkins’s observation that, “The machine 
code of the genes is uncannily computer-like.” Fair enough. But what should we make of 
this similarity between infonnational software — the undisputed product of conscious 
intelligence — and the infonnational sequences found in DNA and other important 
biomolecules? 



Introduction to an Enigma 

I first encountered the DNA enigma as a young scientist in Dallas, Texas, in 1985. At the 
time, I was working for one of the big multinational oil companies. I had been hired as an 
exploration geophysicist several years earlier just as the price of oil had spiked and just as 
I was graduating from college with degrees in physics and geology. My job, as the Texas 
oilmen put it, was to “look for awl out in the guff.'" 

Though I had been a physics and geology student, I had enough exposure to biology to 
know what DNA did. I knew that it stored the instruction set, the information, for 
building proteins in the cell and that it transmitted hereditary traits in living things using 
its four-character chemical alphabet. Even so, like many scientists I had never really 
thought about where DNA — or the information it contained — came from in the first 
place. If asked, I would have said it had something to do with evolution, but I couldn’t 
have explained the process in any detail. 

On February 10, 1985, I learned that I wasn’t the only one. On that day I found myself 
sitting in front of several world-class scientists who were discussing a vexing scientific 
and philosophical question: How did the first life on earth arise? As recently as the 
evening before, I had known nothing about the conference where this discussion was now 
taking place. I had been attending another event in town, a lecture at the Southern 
Methodist University by a Harvard astronomer discussing the big-bang theory. There I 
learned of a conference taking place the following day that would tackle three big 
scientific questions — the origin of the universe, the origin of life, and the nature of human 
consciousness. The conference would bring together scientists from competing 
philosophical perspectives to grapple with each of these issues. The next morning I 
walked into the downtown Hilton where the conference was being held and heard an 
arresting discussion of what scientists knew they didn’t know. 




I was surprised to learn — contrary to what I had read in many text-books — that the 
leading scientific experts on the origin of life had no satisfactory explanation for how life 
had first arisen. These experts, many of whom were present that weekend in Dallas, 
openly acknowledged that they did not have an adequate theory of what they called 
“chemical evolution,” that is, a theory of how the first living cell arose from simpler 
chemicals in the primordial ocean. And from their discussions it was clear that 
DNA — with its mysterious arrangements of chemical characters — was a key reason for 
this impasse. 

The discussion changed the course of my professional life. By the end of that year, I was 
preparing to move to the University of Cambridge in England, in part to investigate 
questions I first encountered on that day in February. 

On its face, my change of course looked like a radical departure from my previous 
interests, and that’s certainly how my friends and family took it. Oil-company geophysics 
was a highly practical, commercially relevant fonn of applied science. A successful study 
of the subsurface of the earth could net the company millions of dollars of revenue from 
the resulting discovery of oil and gas. The origin of life, however, was a seemingly 
intractable — even arcane — theoretical question, with little or no direct commercial or 
practical import. 

Nevertheless, at the time, the transition seemed entirely natural to me. Perhaps it was 
because I had long been interested in scientific questions and discoveries that raised 
larger philosophical issues. In college, I had taken many philosophy courses while 
pursuing my scientific training. But perhaps it was what I was doing at the oil company 
itself. By the 1980s looking for oil required the use of sophisticated computer-assisted 
seismic-imaging techniques, at the time a cutting-edge form of infonnation technology. 
After sending artificial seismic waves down into the earth, geophysicists would time the 
resulting echoes as they traveled back to the surface and then use the information from 
these signals to reconstruct a picture of the subsurface of the earth. Of course, at every 
stage along the way we depended heavily on computers and computer programs to help 
us process and analyze the information we received. Perhaps what I was learning about 
how digital information could be stored and processed in machines and about how digital 
code could direct machines to accomplish specific tasks made life itself — and the digital 
code stored in its DNA — seem less mysterious. Perhaps this made the problem of the 
origin of life seem more scientifically tractable and interesting. In any case, when I 
learned of the enigma confronting origin-of-life researchers and why DNA was central to 
it, I was hooked. 

A controversy that erupted at the conference added to my sense of intrigue. During a 
session on the origin of life, the scientists were discussing where the information in DNA 
had come from. How do chemicals arrange themselves to produce code? What introduced 
drama into what might have otherwise been a dry academic discussion was the reaction 
of some of the scientists to a new idea. Three of the scientists on the panel had just 
published a controversial book called The Mystery of Life ’s Origin with a prominent New 




York publisher of scientific monographs. Their book provided a comprehensive critique 
of the attempts that had been made to explain how the first life had arisen from the 
primordial ocean, the so-called prebiotic soup. These scientists, Charles Thaxton, Walter 
Bradley, and Roger Olsen, had come to the conclusion that all such theories had failed to 
explain the origin of the first life. Surprisingly, the other scientists on the panel — all 
experts in the field — did not dispute this critique. 

What the other scientists did dispute was a controversial new hypothesis that Thaxton and 
his colleagues had floated in the epilogue of their book in an attempt to explain the DNA 
enigma. They had suggested that the information in DNA might have originated from an 
intelligent source or, as they put it, an “intelligent cause.” Since, in our experience, 
information arises from an intelligent source, and since the information in DNA was, in 
their words, “mathematically identical” to the information in a written language or 
computer code, they suggested that the presence of information in DNA pointed to an 
intelligent cause. The code, in other words, pointed to a programmer. 

That was where the fireworks started. Other scientists on the panel became 
uncharacteristically defensive and hostile. Dr. Russell Doolittle, of the University of 
California at San Diego, suggested that if the three authors were not satisfied with the 
progress of origin-of-life experiments, then they should “do them.” Never mind that 
another scientist on the panel who had favored Thaxton’ s hypothesis, Professor Dean 
Kenyon, of San Francisco State University, was a leading origin-of-life researcher who 
had himself performed many such experiments. It was clear that Doolittle regarded the 
three scientists, despite their strong credentials, as upstarts who had violated some 
unspoken convention. Yet it was also clear, to me at least, that the authors of the new 
book had seized the intellectual initiative. They had offered a bold new idea that seemed 
at least intuitively plausible, while those defending the status quo offered no plausible 
alternative to this new explanation. Instead, the defenders of the status quo were forced to 
accept the validity of the new critique. All they could do was accuse the upstarts of 
giving up too soon and plead for more time. 

I left deeply intrigued. If my sense of the scientific status of the problem was accurate — if 
there was no accepted or satisfactory theory of the origin of the first life — then a mystery 
was at hand. And if it was the case that evolutionary theory could not explain the origin 
of the first life because it could not explain the origin of the genetic information in DNA, 
then something that we take for granted was quite possibly an important clue in a mystery 
story. DNA with its characteristic double -helix shape is a cultural icon. We see the helix 
in everything from music videos and modem art to science documentaries and news 
stories about criminal proceedings. We know that DNA testing can establish guilt, 
innocence, paternity, and distant genealogical connections. We know that DNA research 
holds the key to understanding many diseases and that manipulating DNA can alter the 
features of plants and animals and boost food production. Most of us know roughly what 
DNA is and what it does. But could it be that we do not know anything about where it 
came from or how it was first fonned? 





Figure 1.3. Charles Thaxton. Printed by permission from Charles Thaxton. 



The controversy at the conference served to awaken me to the strange combination of 
familiarity and mystique that surrounds the double helix and the digital code it contains. 
In the wake of the conference, I learned that one of the scientists who participated in the 
origin-of-life discussion was living in Dallas. It was none other than Charles Thaxton, the 
chemist who with his coauthors had proposed the controversial idea about an intelligence 
playing a role in the origin of biological information. I called him, and he offered to meet 
with me. We began to meet regularly and talk, often long after work hours. As I learned 
more about his critique of “origin-of-life studies” and his ideas about DNA, my interest 
in the DNA enigma grew. 

These were heady and exciting days for me as I first encountered and grappled with these 
new ideas. If Thaxton was right, then the classical design argument that had been 
dismissed first by Enlightenment philosophers such as David Hume in the eighteenth 
century and then later by evolutionary biologists in the wake of the Darwinian revolution 
might have legitimacy after all. On a visit back home to Seattle, I described what I had 
been learning to one of my earlier college mentors whose critical faculties I greatly 
respected, a philosophy professor named Norman Krebbs. He surprised me when he told 
me that the scientific idea I was describing was potentially one of the most significant 
philosophical developments in three hundred years of Western thought. Could the design 




argument be resuscitated based upon discoveries in modern science? And was DNA the 
key? 

As intriguing as this new line of thinking was for me, I had a growing list of questions. I 
wondered, what exactly is information in a biological context? When biologists referred 
to the sequences of chemicals in the DNA molecule as “information,” were they using the 
tenn as a metaphor? Or did these sequences of chemicals really function in the same way 
as “code” or “text” that humans use? If biologists were using the term merely as a 
metaphor, then I wondered whether the genetic infonnation designated anything real and, 
if not, whether the “information” in DNA could be said to point to anything, much less an 
“intelligent cause.” 

But even if the information in DNA was in some important sense similar to the 
information that human agents devise, it didn’t necessarily follow that a prior intelligent 
cause was the only explanation of such infonnation. Were there causes for information 
that had not yet been considered at the conference that day? Maybe some other cause of 
information would be discovered that could provide a better explanation for the 
information necessary for the origin of life. In short, I wondered, is there really evidence 
for the intelligent design of life, and if so, just how strong is that evidence? Was it, 
perhaps, scientifically premature or inappropriate to consider such a radical possibility, as 
Thaxton’s critics had suggested? 

My concerns about this were heightened because of some of the things that Thaxton and 
his colleagues had written to justify their conclusion. The Mystery of Life ’s Origin had 
made the radical claim that an intelligent cause could be considered a legitimate scientific 
hypothesis for the origin of life. To justify this claim Thaxton and colleagues argued that 
a mode of scientific inquiry they called origins science allowed for the postulation of 
singular acts of intelligence to explain certain phenomena. Thaxton and his colleagues 
distinguished what they called “origins sciences” from “operation sciences.” Operation 
sciences, in their view, focus on the ongoing operation of the universe. These sciences 
describe recurring phenomena like the motions of the planets and chemical reactions that 
can be described by general laws of physics and chemistry. Origins sciences, on the other 
hand, deal with unique historical events and the causes of those events — events such as 
the origin of the universe, the formation of the Grand Canyon, and the invention of 
ancient tools and agriculture. Thaxton and his colleagues argued that inferring an 
intelligent cause was legitimate in origins science, because such sciences deal with 
singular events, and the actions of intelligent agents are usually unique occurrences. On 
the other hand, they argued that it was not legitimate to invoke intelligent causes in 
operations sciences, because such sciences only deal with regular and repeating 
phenomena. Intelligent agents don’t act in rigidly regular or lawlike ways and, therefore, 
cannot be described mathematically by laws of nature. 

Though their terminology was admittedly cumbersome, it did seem to capture an 
intuitively obvious distinction. But still I had questions. Thaxton had argued that theories 
in the operation sciences are readily testable against the repeating phenomena they 




describe. Regularity enables prediction. If a theory describing a repeating phenomenon 
was correct, then it should be able to predict future occurrences of that phenomenon at a 
specific time or under controlled laboratory conditions. Origins theories, however, do not 
make such predictions, because they deal with unique events. For this reason, Thaxton 
thought that such theories could not be tested. Theories about the past can produce 
plausible, but never decisive conclusions. As a geophysicist, I knew that earth scientists 
often fonned hypotheses about past events, but I wasn’t sure that such hypotheses were 
never testable or decisive. We have very good scientific reasons for thinking that 
dinosaurs existed before humans and that agriculture arose after the last ice age. But if 
Thaxton was right, then such conclusions about the past were merely plausible — no more 
than possibly true — and completely untestable. 

Yet I wondered if a hypothesis about the past couldn’t be tested — if there is no way to 
judge its strength or compare it against that of competing hypotheses — then why regard 
the claims of historical or “origins” theories as significant? It is provocative to claim that 
the evidence from DNA and our best scientific reasoning points strongly to an intelligent 
cause of life. It is not very interesting to claim that it is possibly true (“plausible”) that 
DNA owes its origin to such cause. Many statements are merely plausible or possibly 
true. But that doesn’t mean we have any reason to think them likely to be true. Rigorous 
scientific testing usually provides evidence-based reasons for making such claims or for 
preferring one hypothesis over another. Absent such testability, I wasn’t sure how 
significant, or scientific , Thaxton’s argument really was. 

Even so, I was deeply fascinated with the whole issue. In September 1985, 1 learned that I 
was to be laid off from my oil-company job, as the price of oil had dropped from $32 to 
$8 per barrel. I was strangely relieved. I used the rather generous severance the company 
provided to begin supporting myself as a freelance science writer. But soon after I started, 
I also learned that I had received a Rotary scholarship to study in England. The following 
spring a thin ainnail letter arrived informing me that I had been accepted to study the 
history and philosophy of science at the University of Cambridge. This course of study 
would enable me to explore many of the questions that had long fascinated me at the 
intersection of science and philosophy. It would also allow me to investigate the 
questions that had arisen in my discussions with Charles Thaxton. 

What methods do scientists use to study biological origins? Is there a distinctive method 
of historical scientific inquiry? And what does the scientific evidence tell us about the 
origin of biological information and how life began? Is it possible to make a rigorous 
scientific argument for the intelligent design of life? I eventually completed a Ph.D. 
dissertation on the topic of origin-of-life biology. In it, I was able to investigate not only 
the history of scientific ideas about the origin of life, but also questions about the 
definition of science and about how scientists study and reason about the past. 



The Current Controversy 




I couldn’t have known as I was leaving for England, but the two main questions I had 
about Dr. Thaxton’s idea — “Is it scientific?” and “How strong is the evidence for 
it?” — would resurface with a vengeance twenty years later at the center of an 
international controversy, indeed, one that would engage the attention of the mainstream 
media, the courts, the scientific establishment, and the publishing and movie industries. 
In 2005, a federal judge would rule that public-school science students in Dover, 
Pennsylvania, could not learn about the idea that life pointed to an intelligent cause, 
because the idea was neither scientific nor testable. Mainstream scientific 
organizations — such as the National Academy of Sciences and American Association for 
the Advancement of Science — would issue similar pronouncements. 

In 2006 and 2007, a spate of books with titles like The God Delusion and God Is Not 
Great would argue there is no evidence for design in biology and, therefore, no good 
evidence for the existence of God. According to Oxford evolutionary biologist Richard 
Dawkins and other New Atheists, the lack of evidence for design has made the idea of 
God tantamount to a “delusion.” In 2008, the controversy surrounding what is now 
known as the “theory of intelligent design” moved into movie theaters, video stores, and 
candidate press conferences. And this year, with the celebration of the 200th anniversary 
of Darwin’s birth and the 150th anniversary of the publication of On the Origin of 
Species, the main question that Darwin himself addressed — “Was life designed or does it 
merely appear designed?” — has reemerged as scientists, scholars, teachers, and media 
commentators evaluate his legacy. 

Yet in all of this discussion — from Dover to Dawkins to Darwin’s big anniversary — there 
has been very little discussion of DNA. And yet for me and many other scientists and 
scholars, the question of whether science has refuted the design argument or resuscitated 
it depends critically upon the central mystery of the origin of biological information. This 
book examines the many successive attempts that have been made to resolve this 
enigma — the DNA enigma — and will itself propose a solution. 
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The Evolution of a Mystery and Why It Matters 



Few schoolchildren commit to memory the name of nineteenth-century chemist Friedrich 
Wohler, nor is the waste product associated with his most famous experiment easily 
romanticized. Yet in 1828 the German scientist performed an experiment that 
revolutionized our understanding of life. 

As a professor at the Polytechnic School in Berlin, he had begun investigating substances 
that released cyanide when heated. One day he heated some ammonium cyanate, figuring 
it would release cyanide. It didn’t. The heat transfonned the ammonium cyanate crystals, 
altering both their melting point and appearance. Indeed, the resulting material, a white 
crystalline substance, possessed none of the properties typical of cyanates. What had 
happened? The new material seemed familiar somehow. Where had he encountered it 
before? At first he thought it might be an alkaloid, but he had to discard this idea after the 
mysterious substance failed to respond to tests in ways typical of alkaloids. Wohler cast 
about in his memory, rifling through his extensive learning in both chemistry and 
medicine. Then he had it. Urea! 1 Wohler dashed off a letter to fellow chemist Jons Jakob 
Berzelius: “I can no longer, as it were, hold back my chemical water; and I have to let out 
that I can make urea without needing a kidney, or even of an animal, whether of man or 
dog: the ammonium salt of cyanic acid ( cyansaures Ammoniak ) is urea.” 

The experiment, eventually replicated in laboratories around the world, showed that the 
chemical compounds in living organisms could be artificially synthesized. Though 
chemists before Wohler had synthesized naturally occurring mineral substances, many 
assumed it was impossible to synthesize compounds found in organisms, since it was 
thought that organic matter contained mysterious and immaterial “vital forces.” 4 As Sir 
Fredrick Gowland Hopkins later suggested, Wohler’s discovery marked the beginning of 
a challenge to the “primitive faith in a boundary between the organic and the inorganic 
which could never be crossed.” 5 For this reason, Wohler’s work would also exert a 
profound influence on scientific ideas about the origin of life for over a century and 
would serve as a starting point for my own investigation of the topic. 



Beginning at the Beginning 

By the time I arrived in England I was fascinated with the origin of life and wanted to 
leam everything I could about the history of scientific thinking on the subject. I also 
wanted to investigate, following my discussions with Charles Thaxton, whether scientists 
who studied origin events in the remote past used a distinctive method of scientific 
investigation, and if so, what that method of investigation entailed. 



Unfortunately, being an American untutored in the intricacies of the university system in 
the United Kingdom, I found it difficult to find information about the British academic 
programs that best fit my interests. The Rotary scholarship I had received allowed me to 
attend any one of five overseas universities, provided I could gain admittance. Several of 
them offered programs in the history or philosophy of science, but in a pre-Internet era it 
was difficult to extract detailed information from them about the specializations of their 
faculties. In the end, I set my hopes on Cambridge, since it had more of a reputation for 
science than the other universities on my list. 

When my wife, Elaine, and I arrived in the fall of 1986, parking our rental car underneath 
the imposing Gothic architecture on Trumpington Street near the center of Cambridge, I 
was more than a little intimidated. Yet within a few weeks I began to settle into my life as 
a graduate student. I soon discovered that I had made a far better choice of programs than 
I could have known while making my decision. Not only were many of the critical 
discoveries about DNA and molecular biology made in Cambridge, but it also had an 
excellent program in the history and philosophy of science that included a kindly Dutch 
scholar named Harmke Kamminga, who happened to be an expert on the history of 
scientific theories about the origin of life. 

During my first year of study — in between tutorials and lectures on everything from the 
history of molecular biology to the philosophy of physics and the sociology of science — I 
began to meet regularly with Harmke to discuss the origin of life. Under her supervision, 
I began to investigate some current theories about the origin of life, but also the early 
theories that gave rise to them. Thus, I began at the beginning — with a study of how 
origin-of-life studies first emerged as a scientific enterprise in the nineteenth century at 
the time of Darwin and his scientific contemporaries. 

I was soon confronted with an interesting historical puzzle. With the acceptance of 
Darwin’s theory of evolution, most biologists agreed that natural selection could explain 
the appearance of design in biology. For this reason, most philosophers and scientists 
have long thought that Darwin’s theory of evolution by natural selection destroyed the 
design argument. Yet I also discovered that Darwin himself admitted that his theory did 
not explain the origin of life itself. In fact, one day Peter Gautry, an archivist in the 
manuscripts room of the university library, allowed me in to read a letter by Charles 
Darwin on the subject written in 1871, twelve years after the publication of On the Origin 
of Species. The letter, handwritten on brittle paper, made clear that Darwin had little more 
than vague speculations to offer as to how the first life on earth had begun. 6 

This was consistent with what I knew. In the Origin, Darwin did not try to explain the 
origin of the first life. Instead, he sought to explain the origin of new forms of life from 
simpler preexisting forms, fonns that already possessed the ability to reproduce. His 
theory assumed rather than explained the origin of the first living thing. Since this 
limitation of Darwin’s theory was widely recognized, it raised a question: Why were 
nineteenth-and twentieth-century biologists and philosophers so sure that Darwin had 
undennined the design argument from biology? If scientists at the time had no detailed 




explanation for how life had first arisen, how did they know that design — that is, actual 
intelligent design — played no role in this critically important event? 



This chapter tells the story of what I learned as I sought to answer these questions. In the 
process, it describes some of the earliest scientific theories about the origin of life. This 
background will later prove helpful, since many contemporary theories have been 
formulated on the foundation of these earlier approaches. This chapter highlights 
something else I learned in my investigations as well. From the beginning, scientific 
theories about the origin of life have inevitably raised deeper philosophical issues not 
only about life, but also about the nature of ultimate reality. As I discuss at the close of 
the book, these philosophical issues remain with us today and are an integral part of the 
DNA enigma. 

Of course, during the late nineteenth century, scientists were not trying to explain the 
origin of biological information, let alone the infonnation stored in DNA. They did not 
know about DNA, at least not by that name, nor were they thinking about biological 
information even as a concept. But they did seek to explain how life began and were 
keenly aware of the philosophical implications of the theories they proposed. And despite 
their lack of knowledge about the inner workings of the cell, they were often oddly 
confident about the adequacy of these theories. That confidence had much to do with 
Friedrich Wohler’s “Eureka!” — or “Urea!” — moment and how scientists at the time 
viewed the nature of life. 



Setting the Philosophical Stage 

Since the time of the ancient Greeks, there have been two basic pictures of ultimate 
reality among Western intellectuals, what the Gennans call a Weltanschuung, or 
worldview. According to one worldview, mind is the primary or ultimate reality. On this 
view, material reality either issues from a preexisting mind, or it is shaped by a 
preexistent intelligence, or both. Mind, not matter, is, therefore, the prime or ultimate 
reality — the entity from which everything else comes, or at least the entity with the 
capacity to shape the material world. Plato, Aristotle, the Roman Stoics, Jewish 
philosophers such as Moses Maimonides, and Christian philosophers such as St. Thomas 
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Aquinas each held some version of this perspective. Most of the founders of modem 
science during the period historians of science call the scientific revolution (1300-1700) 
also held this mind-first view of reality. Many of these early modem scientists thought 
that their studies of nature confirmed this view by providing evidence, in Sir Isaac 
Newton’s words, of “an intelligent and powerful Being” behind it all. 8 This view of 
reality is often called idealism to indicate that ideas come first and matter comes later. 
Theism is the version of idealism that holds that God is the source of the ideas that gave 
rise to and shaped the material world. 

The opposite view holds that the physical universe or nature is the ultimate reality. In this 
view, either matter or energy (or both) are the things from which everything else comes. 
They are self-existent and do not need to be created or shaped by a mind. Natural 




interactions between simple material entities governed by natural laws eventually 
produce chemical elements from elementary particles, then complex molecules from 
simple chemical elements, then simple life from complex molecules, then more complex 
life from simpler life, and finally conscious living beings such as ourselves. In this view 
matter comes first, and conscious mind arrives on the scene much later and only then as a 
by-product of material processes and undirected evolutionary change. The Greek 
philosophers who were called atomists, such as Leucippus and Democritus, were perhaps 
the first Western thinkers to articulate something like this view in writing . 9 The 
Enlightenment philosophers Thomas Hobbes and David Hume also later espoused this 
matter-first philosophy . 10 Following the widespread acceptance of Darwin’s theory of 
evolution in the late nineteenth century, many modern scientists adopted this view. This 
worldview is called either naturalism or materialism, or sometimes scientific materialism 
or scientific naturalism, in the latter case because many of the scientists and philosophers 
who hold this perspective think that scientific evidence supports it. 

The age-old conflict between the mind-first and matter-first world-views cuts right 
through the heart of the mystery of life’s origin. Can the origin of life be explained purely 
by reference to material processes such as undirected chemical reactions or random 
collisions of molecules? Can it be explained without recourse to the activity of a 
designing intelligence? If so, then such an explanation would seem to make a 
materialistic worldview — with its claim that all of reality can be explained solely by 
undirected material processes — all the more credible. Who needs to invoke an 
unobservable designing intelligence to explain the origin of life, if observable material 
processes can produce life on their own? On the other hand, if there is something about 
life that points to the activity of a designing intelligence, then that raises other 
philosophical possibilities. Does a matter-first or a mind-first explanation best explain the 
origin of life? Either way, the origin of life was not only an intrinsically interesting 
scientific topic, but one that raised incorrigibly philosophical issues as well. For me, that 
was part of what made it interesting. 



The Mystery of the Missing Mystery 

By the close of the nineteenth century, many scientists had accepted the matter-first view. 
Whereas many of the founders of early modem science — such as Johannes Kepler, 
Robert Boyle, and Isaac Newton — had been men of deep religious conviction who 
believed that scientific evidence pointed to a rational mind behind the order and design 
they perceived in nature, many late-nineteenth-century scientists came to see the cosmos 
as an autonomous, self-existent, and self-creating system, one that required no 
transcendent cause, no external direction or design. 

Several nineteenth-century scientific theories provided support for this perspective. In 
astronomy, for example, the French mathematician Pierre Laplace offered an ingenious 
theory known as the “nebular hypothesis” to account for the origin of the solar system as 
the outcome of purely natural gravitational forces . 1 1 In geology, Charles Lyell explained 
the origin of the earth’s most dramatic topographical features — mountain ranges and 




canyons — as the result of slow, gradual, and completely naturalistic processes of change 
such as erosion or sedimentation. In physics and cosmology, a belief in the infinity of 
space and time obviated any need to consider the question of the ultimate origin of 
matter. And, in biology, Darwin’s theory of evolution by natural selection suggested that 
an undirected process could account for the origin of new fonns of life without divine 
intervention, guidance, or design. Collectively, these theories made it possible to explain 
all the salient events in natural history from before the origin of the solar system to the 
emergence of modern fonns of life solely by reference to natural processes — unaided and 
unguided by any designing mind or intelligence. Matter, in this view, had always existed 
and could — in effect — arrange itself without the help of any preexisting intelligence. 

But the origin of the first life remained a small hole in this elaborate tapestry of 
naturalistic explanation. Although Laplace’s nebular hypothesis provided additional 
support for a materialistic conception of the cosmos, it also complicated attempts to 
explain life on earth in purely material terms. Laplace’s theory suggested that earth had 
once been too hot to sustain life, since the environmental conditions needed to support 
life existed only after the planet had cooled below the boiling point of water. For this 
reason, the nebular hypothesis implied that life had not existed eternally, but instead 
appeared at a definite time in earth’s history. To scientific materialists, life might be 
regarded as an eternal given, a self-existent reality, like matter itself. But this was no 
longer a credible explanation for life on earth. There was a time when there was no life 
on earth. And then life appeared. To many scientists of a materialistic turn of mind, this 
implied that life must have evolved from some nonliving materials present on a cooling 
prebiotic earth. Yet no one had a detailed explanation for how this might have happened. 
As Darwin himself noted in 1866, “Though I expect that at some future time the [origin] 
of life will be rendered intelligible, at present it seems to me beyond the confines of 

* ?9 1 4 

science. 

The problem of the origin of life was, at this time, rendered more acute by the failure of 
“spontaneous generation,” the idea that life originates continually from the remains of 
once living matter. This theory suffered a series of setbacks during the 1860s because of 
the work of Louis Pasteur. In 1860 and 1861, Pasteur demonstrated that micro-organisms 
or germs exist in the air and can multiply under favorable conditions. 15 He showed that if 
air enters sterile vessels, contamination of the vessels with microorganisms occurs. 
Pasteur argued that the observed “spontaneous generation” of mold or bacterial colonies 
on rotting food or dead meat, for example, could be explained by the failure of 
experimenters to prevent contamination with preexisting organisms from the 
atmosphere. 16 Pasteur’s work seemed to refute the only naturalistic theory of life’s origin 
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then under experimental scrutiny. 

Despite the impasse, late-Victorian-era biologists expressed little, if any, concern about 
the absence of detailed explanations for how life had first arisen. The obvious question 
for me was, Why? From my vantage point in 1986, having just learned about the current 
impasse in contemporary origin-of-life research, the nonchalance of the Victorians 
seemed itself a bit mysterious. 




As I began investigating these questions during my first year at Cambridge, I discovered 
that these scientists actually had several reasons for holding this point of view. Even 
though many scientists knew that Darwin had not solved the origin-of-life problem, they 
were confident that the problem could be solved because they were deeply impressed by 
the results of Friedrich Wohler’s experiment. Before the nineteenth century many 
biologists had taken it as almost axiomatic that the matter out of which life was made was 
qualitatively different than the matter in nonliving chemicals. These biologists thought 
living things possessed an immaterial essence or force, an elan vital, that conferred a 
distinct and qualitatively different kind of existence upon organisms. 18 Scientists who 
held this view were called “vitalists,” a group that included many pioneering biologists. 

Since this mysterious elan vital was responsible for the distinctive properties of organic 
matter, vitalists also thought that it was impossible to change ordinary inorganic matter 
into organic matter. After all, the inorganic matter simply lacked the special 
ingredient — the immaterial right “stuff.” That’s why Wohler’s experiment was so 
revolutionary. He showed that two different types of inorganic matter could be combined 
to produce organic matter, albeit of a somewhat inglorious kind. Though some scientists 
continued to support vitalism well into the twentieth century, they had to do so on other 
grounds. 

Thus, Wohler’s experiment had a direct influence on thinking about the origin of life. If 
organic matter could be formed in the laboratory by combining two inorganic chemical 
compounds, then perhaps organic matter could have formed the same way in nature in the 
distant past. If organic chemicals could arise from inorganic chemicals, then why 
couldn’t life itself arise in the same way? After all, if vitalism was as wrong as it now 
appeared, then what is life but a combination of chemical compounds? 

Developments in other scientific disciplines reinforced this trend in thought. In the 1850s, 
a German physicist named Hennann von Helmholtz, a pioneer in the study of heat and 
energy (thermodynamics), showed that the principle of conservation of energy applied 
equally to both living and nonliving systems. The conservation of energy is the idea that 
energy is neither created nor destroyed during physical processes such as burning or 
combustion, but merely converted to other fonns. 

The chemical energy in gasoline, for example, is used by an engine to propel a car. The 
engine bums the gasoline and uses it up. But the energy contained in the gasoline is not 
destroyed; it is converted into heat (or thermal) energy, which in the cylinders is turned 
into mechanical or kinetic energy to propel the car. Helmholtz demonstrated that this 
same principle of energy conservation applied to living systems by measuring the amount 
of heat that muscle tissues generated during exercise. 19 His experiment showed that 
although muscles consume chemical energy, they also expend energy in the work they 
perform and the heat they generate. That these processes were in balance supported what 
became known as the “first law of thermodynamics” — energy is neither created nor 
destroyed. 




Even before this first law of thermodynamics had been refined, Helmholtz used a version 
of it to argue against vitalism. If living organisms are not subject to energy conservation, 
if an immaterial and immeasurable vital force can provide energy to organisms “for free,” 
then perpetual motion would be possible. But, argued Helmholtz, we know from 
observation that is impossible. Other developments supported this critique of vitalism. 
During the 1860s and 1870s scientists identified the cell as the energy converter of living 
organisms. Experiments on animal respiration established the utility of chemical analysis 
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for understanding respiration and other energetic processes in the cell. Since these new 

chemical analyses could account for all the energy the cell used in metabolism, biologists 
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increasingly thought it unnecessary to refer to vital forces. 

As new scientific discoveries undennined long-standing vitalist doctrines, they also 
bolstered the confidence of scientific materialists. German materialists, such as the 
biologist Ernst Haeckel, denied any qualitative distinction between life and nonliving 
matter: “We can no longer draw a fundamental distinction between organisms and 
anorgana [i.e., the nonliving].” In 1858, in an essay entitled “The Mechanistic 
Interpretation of Life,” another German biologist, Rudolf Virchow, challenged vitalists to 
“point out the difference between chemical and organic activity.” With vitalism in 
decline, Virchow boldly asserted his version of the materialist credo: “Everywhere there 
is mechanistic process only, with unbreakable necessity of cause and effect.” Life 
processes could now be explained by various physical or chemical mechanisms. Since, in 
our experience, mechanisms — like cogged wheels turning axles — involve material parts 
in motion and nothing more, this meant that the current function of organisms could be 
explained by reference to matter and energy alone. 

This outlook encouraged scientific materialists to assume they could easily devise 
explanations for the origin of life as well. Haeckel himself would be one of the first 
scientists to try. If life was composed solely of matter and energy, then what else besides 
matter in motion — material processes — could possibly be necessary to explain life’s 
origin? For materialists such as Haeckel, it was inevitable that scientists would succeed in 
explaining how life had arisen from simpler chemical precursors and that they would do 
so only by reference to materialistic processes. For Haeckel, finding a materialistic 
explanation for the origin of life was not just a scientific possibility; it was a 
philosophical imperative. “ 6 



Evolution on a Roll 

If the imperative for many scientists during this time was matter first, the central image 
was increasingly that of evolution, of nature unfolding in an undirected way, with the 
nebular and Darwinian hypotheses suggesting the possibility of an unbroken evolutionary 
chain up to the present. Yes, the origin of life was a missing link in that chain, but surely, 
it was thought, the gap would soon be bridged. Darwin’s theory, in particular, inspired 
many evolutionary biologists to begin fonnulating theories to solve the origin-of-life 
problem. My supervisor, Dr. Kamminga, had a memorable way of describing this 




phenomenon. She noted that the success of Darwin’s theory inspired attempts at 
“extending evolution backward” in order to explain the origin of the first life. 

Darwin’s theory inspired confidence in such efforts for several reasons. First, Darwin had 
established an important precedent. He had shown that there was a plausible means by 
which organisms could gradually produce new structures and greater complexity by a 
purely undirected material process. Why couldn’t a similar process explain the origin of 
life from preexisting chemicals? 

Darwin’s theory also implied that living species did not possess an essential and 
immutable nature. Since Aristotle, most biologists had believed that each species or type 
of organism possessed an unchanging nature or form; many believed that these forms 
reflected a prior idea in the mind of a designer. But Darwin argued that species can 
change — or “morph” — over time. Thus, his theory challenged this ancient view of life. 
Classification distinctions among species, genera, and classes did not reflect unchanging 
natures. They reflected differences in features that organisms might possess only for a 
time. They were temporary and conventional, not set in stone. If Darwin was right, then 
it was futile to maintain rigid distinctions in biology based on ideas about unchanging 
forms or natures. This reinforced the conviction that there was no impassable or 
unbridgeable divide between inanimate and animate matter. Chemicals could “morph” 
into cells, just as one species could “morph” into another. 28 

Darwin’s theory also emphasized the importance of environmental conditions on the 
development of new forms of life. If conditions arose that favored one organism or fonn 
of life over another, those conditions would affect the development of a population 
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through the mechanism of natural selection. This aspect of Darwin’s theory suggested 
that environmental conditions may have played a crucial role in making it possible for 
life to arise from inanimate chemistry. It was in this context that Darwin himself first 
speculated about the origin of life. In the 1871 letter to botanist Joseph Hooker, which I 
had seen in the Cambridge library archive, Darwin sketched out a purely naturalistic 
scenario for the origin of life. He emphasized the role of special environmental conditions 
and just the right mixture of chemical ingredients as crucial factors in making the origin 
of life possible: “It is often said that all the conditions for the first production of a living 
organism are present.... But if (and Oh! what a big if!) we could conceive in some warm 
little pond, with all sorts of ammonia and phosphoric salts, light, heat, electricity, etc., 
that a protein compound was chemically fonned ready to undergo still more complex 
changes, at the present day such matter would be instantly devoured or absorbed, which 
would not have been the case before living creatures were fonned.” Although Darwin 
conceded that his speculations ran well ahead of available evidence, the basic approach 
he outlined would seem increasingly plausible as a new theory about the nature of life 
came to prominence in the 1860s and 1870s. 



The Protoplasmic Theory of Life 




In my first year of research, I came across a statement by Russian scientist Aleksandr 
Oparin. Oparin was the twentieth century’s undisputed pioneer of origin-of-life studies, 
and his comment helped me to identify another key reason for the Victorian lack of 
concern about the origin-of-life problem. “The problem of the nature of life and the 
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problem of its origin have become inseparable,” he said. 

To explain how life originated, scientists first have to understand what life is. That 
understanding, in turn, defines what their theories of the origin of life must explain. The 
Victorians weren’t especially concerned with the origin-of-life problem because they 
thought simple life was, well, simple. They really didn’t think there was much to explain. 
Biologists during this period assumed that the origin of life could eventually be explained 
as the by-product of a few simple chemical reactions. 

Then, as now, scientists appreciated that many intricate structures in plants and animals 
appeared designed, an appearance that Darwin explained as the result of natural selection 
and random variation. But for Victorian scientists, single-celled life didn’t look 
particularly designed, most obviously because scientists at the time couldn’t see 
individual cells in any detail. Cells were viewed as “homogeneous and structure-less 
globules of protoplasm,” amorphous sacs of chemical jelly, not intricate structures 
manifesting the appearance of design. 

In the 1860s, a new theory of life encouraged this view. It was called the “protoplasmic 
theory,” and it equated vital function with a single, identifiable chemical substance called 
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protoplasm. According to this theory, the attributes of living things derived from a 
single substance located inside the walls of cells. This idea was proposed as a result of 
several scientific developments in the 1840s and 1850s. 34 In 1846, a Gennan botanist 
named Hugo von Mohl demonstrated that plant cells contained a nitrogen-rich material, 
which he called protoplasm. He also showed that plant cells need this material for 
viability. Mohl and Swiss botanist Karl Nageli later suggested that protoplasm was 
responsible for the vital function and attributes of plant cells and that the cell wall merely 
constituted an “investment lying upon the surface of the [cell] contents, secreted by the 
contents themselves.” 

This turned out to be fantastically inaccurate. The cell wall is a separate and fascinatingly 
intricate structure containing a system of gates and portals that control traffic in and out 
of the cell. Nevertheless, Mohl and Nageli’ s emphasis on the importance of the cell 
contents received support in 1850 when a biologist named Ferdinand Cohn showed that 
descriptions of protoplasm in plants matched earlier descriptions of the “sarcode” found 
in the cavities of unicellular animals. By identifying sarcode as animal-cell protoplasm, 
Cohn connected his ideas to Mohl’s. Since both plants and animals need this substance to 
stay alive, Cohn established that protoplasm was essential to all living organisms. When, 
beginning in 1857, a series of papers by scientists Franz Leybig, Heinrich Anton de Bary, 
and Max Shultze suggested that cells could exist without cellular membranes (though, in 
fact, we now know they cannot), scientists felt increasingly justified in identifying 
protoplasm as life’s essential ingredient. 38 Thus, in 1868 when the famous British 




scientist Thomas Henry Huxley declared in a much publicized address in Edinburgh that 
protoplasm constituted “the physical basis or matter of life” (emphasis in original), his 
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assertion expressed a gathering consensus. 

With the protoplasmic theory defining the chemical basis of life, it seemed plausible that 
the right chemicals, in the right environment, might combine to make the simple 
protoplasmic substance. If so, then perhaps the origin of life could be explained by 
analogy to simple processes of chemical combination, such as when hydrogen and 
oxygen join to fonn water. If water could emerge from the combination of two 
ingredients as different from water as hydrogen and oxygen, then perhaps life could 
emerge from the combination of simple chemical ingredients that by themselves bore no 
obvious similarity to living protoplasm. 



Early Theories of Life’s Origin: The Chemical Two-Step 

I discovered another reason that scientists had maintained their confidence in a 
completely materialistic account of life and the cosmos. Beginning in the late 1860s, 
scientists began to offer materialistic theories of the origin of life. And for the better part 
of the next eighty-five years or so (with the exception of one gap after the turn of the 
century), these theories kept pace with new scientific discoveries about the complexity of 
life. That is, for the most part, these new theories about how life came to be were able to 
explain what scientists were learning about what life is. 

Two scientists, Thomas Henry Huxley and Ernst Haeckel, were first to offer theories of 
how life had arisen from nonliving chemicals. Though Huxley was British and Haeckel 
Gennan, the two men had much in common intellectually. Both men rejected vitalism. 
Both men were staunch defenders of Darwin’s evolutionary approach to the origin of 
species. Both were ardent scientific materialists. And both had articulated or defended the 
protoplasmic theory of life. In these respects, Huxley and Haeckel embodied the various 
reasons for Victorian insouciance about the origin-of-life problem. Each man would 
formulate a theory of abiogenesis (life arising from nonliving matter) that reflected this 
intellectual posture. 

Huxley imagined that the origin of life had occurred by a simple two-step chemical 
process in which simple elements such as carbon, hydrogen, nitrogen, and oxygen first 
reacted to form common compounds such as water, carbonic acid, and ammonia. 40 He 
believed that these compounds then combined, under some unspecified conditions, to 
form protoplasm, the chemical essence of life. 

Meanwhile, in Germany, Haeckel 41 offered a bit more detail, though not much. He 
identified “constructive inner forces” or “formative tendencies” inherent in matter — like 
those we find in inorganic crystal fonnation — as the cause of the self-development of life. 
He asserted that the causes that produce form are the same in both inorganic crystals and 
living organisms. 4 ” Thus, for Haeckel, the origin of life could be explained by the 
spontaneous crystallization of “formless lumps of protein” from simpler carbon 




compounds. 43 Haeckel believed that, once formed, the first one-celled organisms, which 
he called Monera, would have gradually attained the relatively simple structure he 
assumed them to possess as they assimilated new material from the environment. Then, 
due to their semifluid constitution, these primitive cells would continue to rearrange 
themselves internally over time. 44 Even so, he clearly regarded the essential step in the 
process of abiogenesis as complete after the spontaneous crystallization of the 
“homogeneous and structure-less globules of protoplasm.” 45 

Huxley also viewed the nature of life as scarcely distinguishable from inorganic crystals. 
Many other biologists adopted similar views. Eduard Pfluger, Karl Wilhelm von Nageli, 
August Weismann, and Oscar Loew each attributed the essential properties of life to a 
single chemical entity rather than to complex processes involving many interrelated 
parts. 46 Pfluger, for example, thought the presence of carbon and nitrogen (in the form of 
the cyanogen radical, -CN) distinguished “living” proteins from “dead” ones. 47 By 
equating the essence of life with a single chemical unit such as “living proteins” 
(Pfluger), “active proteins” (Loew), “biophors” (Weismann), “probionts” (Nageli), or 
“homogeneous protoplasm” (Haeckel and Huxley), scientists during the 1870s and 1880s 
made it easy to explain the origin of life. Yet only as long as their simplistic conceptions 
of the nature of life held sway did their equally simplistic models of the origin of life 
seem credible. 

Over the next sixty years biologists and biochemists gradually revised their view of the 
nature of life. During the 1890s, scientists began to leam about enzymes and other types 
of proteins. Before 1894, scientists had only observed enzymes catalyzing reactions 
outside the cell. 48 With the advance of laboratory techniques allowing scientists to gather 
evidence of the activity of enzymes within cells, and with the discovery of enzymes 
responsible for such metabolic reactions as oxidation, fennentation, and fat and protein 
synthesis, a new theory called the “enzymatic theory” displaced the protoplasmic theory 
of life. 49 By the turn of the century, most biologists came to see the cell as a highly 
complex system of integrated chemical reactions, not at all the sort of thing that could be 
adequately explained by vague references to processes of crystallization. For a time, the 
growing awareness of this chemical complexity impeded attempts to explain the origin of 
life. But by the 1920s and 1930s, a pioneering Russian scientist formulated a new theory 
to keep pace with this growing scientific awareness of the cell’s complexity. 



Oparin to the Rescue 

A new theory of evolutionary abiogenesis that envisioned a multibillion-year process of 
transformation from simple chemicals to a complex metabolic system 50 was proposed to 
the Russian Botanical Society in May 1922 by the young Soviet biochemist Aleksandr I. 
Oparin (1894-1980). 51 Oparin first published his theory in Russian in 1924 and then 
refined and developed it, publishing it again in English in 1938. Both books were simply 
called The Origin of Life. 




Oparin’s interest in the origin of life was first awakened after hearing lectures on 
Darwinism from the plant physiologist Kliment Arkadievich Timiriazev, who was 
himself a staunch Darwinian. “According to Oparin,” writes historian of science Loren 
Graham, “Timiriazev described Darwinian evolution and revolutionary political thought 
as being so intimately connected that they amounted to the same thing. In this view 
Darwinism was materialistic, it called for change in all spheres, it was atheistic, it was 

CD 

politically radical, and it was causing a transformation of thought and politics.” 




Figure 2.1. Aleksandr Oparin (1894-1980), pioneering chemical evolutionary theorist. 
Courtesy of Novosti/Photo Researchers, Inc. 



Oparin was a fascinating figure from a fascinating time. He published his first theory of 
the origin of life just five years after the Bolshevik Revolution while living in Moscow, 
where Marxist slogans and thinking were popular, especially in intellectual circles. 53 It 
seemed a little odd to me at first that anyone could think about something as seemingly 
remote as the origin of the first life while such cataclysmic changes were taking place in 
society, but I discovered many of the early Marxists were quite interested in the subject 
of biological origins. Marx himself had corresponded with Darwin, and he thought that 
Darwin’s theory of evolution put his own theory about how societies evolved on a firm 
materialistic and scientific footing. 54 





Friedrich Engels, Marx’s intellectual collaborator, actually wrote an essay on the origin 
of the first life. 55 Like Marx, he was convinced that major societal changes took place in 
sudden spurts in response to changes in the material conditions of life and society. He 
wanted to show that a similar “revolution” had taken place to produce life, so that he 
could demonstrate the plausibility of Marxist doctrine. A key Marxist idea was that a 
small quantitative increase in the intensity of some condition or situation could suddenly 
produce a qualitative or revolutionary change. Dissatisfaction and alienation with the 
capitalist system among workers, for example, might gradually increase over time, but it 
would eventually grow to a point where a revolutionary change would suddenly occur, 
ushering in a completely new way of ordering society. Engels thought he could illustrate 
this key Marxist concept if he showed that a quantitative increase in the complexity of a 
system of chemicals could suddenly produce a qualitative (i.e., revolutionary) change in 
that system, resulting in the first life. 56 

Was Oparin influenced or motivated by such specifically Marxist ideas? In addition to 
Timiriazev, whose politics Oparin described as “very progressive” and Leninist, Oparin 
was also closely associated with an older Marxist biochemist and former revolutionary, 
A. N. Bakh, after 1920. Even so, it’s not clear how much Marxism per se influenced 
Oparin’s thinking about the origin of life. It is clear, however, that Oparin rejected all 
forms of idealism. Instead, he embraced a materialistic view of reality. Accordingly, he 
saw that the origin-of-life problem needed to be solved within a materialistic framework 
of thought. 58 

At the same time, Oparin thought there were a number of scientific reasons for supposing 
that the origin of life could be explained by reference to purely chemical processes. First 
of all, there was Wohler’s famous synthesis of urea, which showed that both living and 
nonliving matter share a common chemical basis. It was clear from Oparin’s writing that, 
a hundred years after Wohler’s experiment, it continued to have a profound influence on 
thinking about the nature and origin of life. To Oparin, Wohler’s experiment established 
that “there is nothing peculiar or mysterious” about the processes at work in a living cell 
“that cannot be explained in tenns of the general laws of physics and chemistry.” Oparin 
also noted that several nonliving materials, not just urea, manifest attributes like those 
once thought to characterize only living organisms. 59 For example, carbon, the element 
common to all living protoplasm and organisms, also occurs naturally in inanimate 
minerals such as graphite, diamond, marble, and potash. Besides, argued Oparin, like 
living organisms, many inorganic materials display chemical organization and structure. 
Inanimate materials like crystals and magnets have a very definite and orderly 
organization. Crystals even reproduce themselves, though not in the same way cells do. 
Although Oparin admitted inanimate materials like crystals do not have the kind of 
“complicated order” observed in living cells, the similarities he did identify between life 
and nonlife made him optimistic that scientists could explain the origin of life by 
reference to ordinary chemical processes. 60 

Even so, given the complexity of the chemical reactions going on inside the cell, Oparin 
thought any return to spontaneous generation was untenable. As he stated, “The idea that 




such a complicated structure with a completely determinate fine organization could arise 
spontaneously in the course of a few hours... is as wild as the idea that frogs could be 
formed from the May dew or mice from corn .” 61 Instead, in his view, biological 
organization must have evolved gradually from simpler chemistry over a long period of 
time . 62 



Oparin Sets the Stage 

Oparin’s theory envisioned many discrete events along the way to the development of 
life. Nevertheless, his theory described processes that can be divided into two basic 
stages. The first part of his theory described how the chemical building blocks of life 
arose from much simpler chemicals in the earth’s atmosphere and oceans. The second 
stage tells how the first organism arose from these molecular building blocks. Let’s look 
at the first part of Oparin’s scenario first. 
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Figure 2.2. Oparin’s 1936 early earth origin-of-life scenario. 
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Oparin thought that the early earth had a core made of heavy metals. As the early earth 
cooled after its initial formation, he postulated, its core would have contracted, exposing 
cracks and fissures in the surface of the earth. Heavy metals from the core would have 
then combined with carbon fonning compounds called iron carbides. These compounds 
were squeezed to the surface of the earth like toothpaste through a tube (see Fig. 2.2). 

After arriving at the surface these carbide compounds would have begun to react with the 
atmosphere. By 1936, Oparin had come to think that the early earth’s atmosphere was 



devoid of free oxygen. Instead, he envisioned an early atmosphere containing a noxious 
mixture of energy-rich gases such as ammonia (NH 3 ), dicarbon (C 2 ), cyanogen (CN), 
steam, and simple hydrocarbons like methene (CH) and methylene (CH 2 ). He then 
envisioned these simple hydrogen-rich molecules in the atmosphere reacting with the iron 
carbides arriving at the surface of the earth. This would have resulted in the formation of 
heavy energy-rich hydrocarbons, the first organic molecules . 64 

The compounds 65 produced in this way would have then reacted with ammonia (NH 3 ) in 
the atmosphere to fonn various nitrogen-rich compounds . 66 This was a significant step, 
because Oparin knew that the amino acids out of which protein molecules are made are 
themselves rich in nitrogen. Oparin also thought that energy-rich hydrocarbon derivatives 
in water could participate in every type of chemical change occurring in the cell, 
including polymerization. This was important because polymerization is the kind of 
reaction by which amino acids link up to form proteins. Thus, Oparin suggested that these 
hydrocarbon derivatives reacted with one another and with other chemicals in the oceans 
to produce amino acids, which then linked together to form proteins. 



Oparin’s Account of the First Organisms 

The second stage of Oparin’s scenario used specifically Darwinian evolutionary concepts 
to explain the transformation of organic molecules into living things. In particular, he 
suggested that competition for survival arose between little enclosures of protein 
molecules. This competition eventually produced primitive cells with all kinds of 
complex chemical reactions going on inside them. But before he could describe how 
competition between protocells produced life, he had to find a chemical structure that 
could function as a primitive cell, or at least as a primitive cell membrane. He needed an 
inanimate structure that could enclose proteins and separate them from the environment. 

He found what he was looking for in the work of an obscure Dutch chemist named H. G. 
Bungenberg de Jong. In 1932, Bungenberg de Jong described a structure called a 
“coacervate,” (from the Latin coacervare, meaning “to cluster”). A coacervate is a little 
cluster of fat molecules that clump together into a spherical structure because of the way 
they repel water. (See Fig. 2.3.) Because these fat molecules, or lipids, have a 
water-repelling side and a water-attracting side, they will fonn a structure that both repels 
water on the outside and encloses water on the inside. Thus, these coacervates define a 
distinct boundary with the surrounding environment. They even allow organic molecules 
to pass in and out of the coacervate cluster, thus simulating the function of a cell 
membrane. 

Oparin suggested that biologically significant molecules like carbohydrates and proteins 
could have been enclosed in such structures in the prebiotic ocean. As these molecules 
began to react with one another inside the coacervate clumps, they developed a kind of 
primitive metabolism. For this reason, Oparin regarded them as intermediate structures 
between animate and inanimate chemistry: “With certain reservations we can even 




consider the first piece of organic slime which came into being on the earth as being the 
first organism.” 67 




Figure 2.3. Two-dimensional cross section of a coacervates (left) and three-dimensional 
cutaway of half of a coacervate (right). 



Oparin proposed that the attributes of these coacervate clumps would enable the complex 
biochemical organization that now characterizes living cells to arise gradually through a 
process of natural selection. As some coacervates grew, they would develop increasingly 
efficient means for assimilating new substances from the environment, causing their 
growth rates to increase. Those that failed to develop efficient means of assimilating 
essential nutrients would languish. The good “eaters” thrived, while the poor “eaters” did 
not. As the relative abundance of nutrients in the environment changed, conditions arose 
that favored more highly organized organic bodies. Less efficient protocells would soon 
exhaust their stored potential energy and decompose as their supplies of nutrients 
diminished. But those primitive organisms that had (by chance) developed crude forms of 
metabolism would continue to develop. A Darwinian-style competition developed, 
eventually resulting in the first living cell. 68 (See Fig. 2.4.) 

So Aleksandr Oparin explained the origin of life using Darwinian principles. He showed 
how complex structures could arise gradually from simpler ones after environmental 
changes had occurred that favored the complex structures in their competition for 
survival. 



The Miller-Urey Experiment 

Oparin’s theory stimulated considerable scientific activity following the English 
publication of his book; several scientists during the 1940s and early 1950s developed 
and refined Oparin’s scenario in pursuit of a more detailed theory of chemical evolution. 
Perhaps the most significant attempts to advance Oparin’s research program came in the 
form of laboratory experiments, including several attempts to simulate an important step 
in his historical narrative, the production of biological building blocks from simpler 



atmospheric gases. The most famous, immortalized in high-school biology textbooks 
around the world, is the Miller-Urey experiment. 

In December 1952, while doing graduate work under Harold Urey at the University of 
Chicago, Stanley Miller conducted the first experimental test of the Oparin-Haldane 
chemical evolutionary model. Using boiling water, Miller circulated a gaseous mixture of 
methane, ammonia, water, and hydrogen through a glass vessel containing an electrical 
discharge chamber. 69 Miller sent a high-voltage charge into the chamber via tungsten 
filaments to simulate the effects of lighting on prebiotic atmospheric gases. (See Fig. 
2.5.) After two days, Miller found amino acids in the U-shaped water trap he used to 
collect reaction products at the bottom of the vessel. Using a technique for analyzing 
mixtures called paper chromatography, he identified the amino acids glycine, 
alpha-alanine, and beta-alanine. 
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Figure 2.4. Simplified schematic of Oparin’s 1936 chemical evolutionary scenario 
showing the main steps from simple chemicals to a more complex living cell. 
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Figure 2.5. The Miller-Urey experiment. 



Miller’s success in producing these protein building blocks was heralded as a 
breakthrough and as powerful experimental support for Oparin’s theoretical work. After 
the publication of his findings in 1953, others replicated Miller’s results, and soon hopes 
were running high that a comprehensive theory of life’s origin was within reach. Miller’s 



experiment received widespread coverage in popular publications such as Time magazine 
and gave chemical evolutionary theory the status of textbook orthodoxy almost 
overnight. As science writer William Day reflected, “It was an experiment that broke 
the logjam. The simplicity of the experiment, the high yields of the products and the 
specific biological compounds. . .produced by the reaction were enough to show the first 
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step in the origin of life was not a chance event, but was inevitable.” 

By the Darwinian centennial in 1959, spirits were running high. The final holdout in the 
grand materialistic story of life, earth, and the cosmos seemed at last to be falling into 
line. With the modern “neo-Darwinian” version of Darwin’s theory firmly established 
and the emergence of an experimentally validated theory of chemical evolution, 
evolutionary science had now provided a comprehensive and fully naturalistic 
explanation for every appearance of design in the living world from the humble 
single-celled bacterium to the most intricate neural structure of the human brain. A 
seamless and fully naturalistic account of the origin and development of life-forms 
appeared, if not complete, then at least sketched in enough detail to preclude 
anachronistic speculations about a designing hand. The problem of the origin of life had 
at last been solved. Or at least so it seemed, until scientists began to reflect more deeply 
on the other great discovery of 1953. 




3 



The Double Helix 



The in formation revolution in biology officially began in 1953 with the elucidation of the 
structure of the DNA molecule. Yet the scientists who eventually ignited this revolution 
were a seemingly unimpressive pair. Two unknowns in the developing field of 
biochemistry, James Watson and Francis Crick possessed no firsthand experimental data 
and a very limited knowledge of the relevant chemistry. Crick hadn’t even finished his 
Ph.D., and the degree he did have was in physics. 1 

There were three teams in the running to unlock the mystery of the structure of DNA, 
which by now most biologists assumed would help explain how hereditary traits are 
passed from one generation to another. The obvious front-runner was Nobel laureate 
Linus Pauling. Next was an imposing team of Cambridge scientists headed by Lawrence 
Bragg and Max Perutz. The third was a team with the most sophisticated imaging 
equipment in the world, headed by Maurice Wilkins at King’s College, University of 
London. As for Watson and Crick, no scientist at the time would have even put them in 
the race. By most appearances, they were mere hangers-on at the Cavendish lab in 
Cambridge, a couple of young men lurking around pilfering other people’s data, out of 
their depth and out of the race. 

While the London team leader, Maurice Wilkins, and its X-ray specialist, Rosalind 
Franklin, did the painstaking work of collecting the hard data on the mystery molecule, it 
looked as if Watson and Crick did little more than play with toy models. One they 
presented was greeted with laughter by Rosalind Franklin. Drawing on her extensive 
knowledge of the X-ray images that she had generated of crystals made of DNA material, 
she quickly convinced Watson, Crick, and everyone else in the room that their toy model 
was far wide of the mark. 

Watson, with his wild hair and perfect willingness to throw off work for a Hedy Lamarr 
film, and Crick, a dapper and no longer especially young fellow who couldn’t seem to 
close the deal on his dissertation — who were these guys? They even got their lab space 
stripped away at one point. Eventually they got it back, but a peek into the Cavendish lab 
months later would have done little to inspire confidence. Crick hadn’t arrived yet, and 
there was wild-haired Watson at the table tinkering around with cardboard cutouts — a far 
cry from the sophisticated technology on display at the King’s lab. 

But it was in the end Watson and Crick who sparked a revolution. The molecular 
biological revolution, as it came to be called, would redefine our understanding of the 
nature of life by highlighting the importance of information to the inner workings of 



living things. This revolution would also redefine the questions that scientists 
investigating the origin of life would, from that time forward, have to answer. 



Of Natures and Origins 

During my Ph.D. studies, I learned that scientists investigating the past often reason much 
like detectives in a whodunit. Detectives consider a number of suspects in order to 
detennine the culprit as they try to reconstruct the scene of the crime. In a similar way, 
historical scientists — such as geologists, archaeologists, paleontologists, cosmologists, 
and evolutionary biologists — weigh the merits of competing explanations as they try to 
figure out what caused a particular event in the past to occur or what caused a given 
structure or piece of evidence to arise. In doing so, historical scientists use a scientific 
method called the “method of multiple working hypotheses.” But before scientists can 
evaluate competing ideas about the cause of a given event or structure, they must have a 
clear understanding of what it is that needs to be explained. 

For scientists trying to explain the origin of life, one of the most important clues we have 
is life itself — its structure, function, and composition. That’s why Aleksandr Oparin, the 
first scientist to propose a comprehensive scientific theory of the origin of life, said, “The 
problem of the nature of life and the problem of its origin have become inseparable.” 
Harmke Kamminga puts it this way: “At the heart of the problem of the origin of life lies 
a fundamental question: What is it exactly that we are trying to explain the origin of?” 4 

Watson and Crick’s discovery, and those that soon followed in its wake, revolutionized 
our understanding of the nature of life. These discoveries also defined the features of life 
that scientists are now “trying to explain the origin of.” This chapter tells the story of the 
discovery that inaugurated this revolution in biological understanding — the story of the 
double helix. This historical background will prove indispensable in later chapters. In 
order to evaluate competing ideas about the origin of life and biological infonnation, it is 
important to know what DNA is, what it does, and how its shape and structure allow it to 
store digital infonnation. As I show in subsequent chapters, some recent theories of the 
origin of life have failed precisely because they have failed to account for what scientists 
have discovered over the last century about the chemical structure of DNA and the nature 
of biological information. 



The Mystery of Heredity 

From ancient times, humans have known a few basic facts about living things. The first is 
that all life comes from life. Omne vivum ex vivo. The second is that when living things 
reproduce themselves, the resulting offspring resemble their parents. Like produces like. 
But what inside a living thing ensures that its offspring will resemble itself? Where does 
the capacity to reproduce reside? 

This was one of the long-standing mysteries of biology, and many explanations have 
been proposed over the centuries. One theory proposed that animals contained miniature 




replicas of themselves stored in the reproductive organs of males. Another theory, called 
pangenesis, held that every tissue or organ of the body sent parts of itself — called 
gemmules — to the reproductive organs to influence what was passed on to the next 
generation. But by the mid-nineteenth century the target began to narrow as scientists 
increasingly focused on the small spherical enclosures called cells, only recently within 
reach of the best microscopes of the day. In 1839, Matthias Schleiden and Theodor 
Schwann proposed the “cell theory,” which asserted that cells are the smallest and most 
fundamental unit of life. In the wake of their proposal, biologists increasingly focused 
their search for the secret of heredity on these seemingly magical little entities and their 
critical contents. But through much of the rest of the nineteenth century, the structure of 
cells was a complete mystery, which is why prominent scientists like Ernst Haeckel could 
describe the cell as “homogeneous and structure-less globules of protoplasm.” 5 

Meanwhile, however, scientists began to notice that the transmission of hereditary 
traits — wherever the capacity for producing these traits might be stored — seemed to 
occur in accord with some predictable patterns. The work of Gregor Mendel in the 1860s 
was particularly important in this regard. Mendel studied the humble garden pea. He 
knew that some pea plants have green seeds, while some have yellow. When he crossed 
green peas with yellow peas, the second-generation plants always had yellow peas. If 
Mendel had stopped there, he might have assumed that the capacity for making green 
seeds in the next generation had been lost. But Mendel didn’t stop there. He crossed the 
crosses. Each of these parent plants had yellow seeds, but their offspring had 75 percent 
yellow seeds and 25 percent green. Apparently the first generation of crossed seeds, the 
all-yellow batch, nevertheless had something for making “green” seeds tucked away 
inside of them, waiting to emerge in a subsequent generation, given the right 
circumstances. 6 

Mendel called the yellow trait “dominant” and the green trait “recessive.” The latter 
might disappear in a given generation, but it hadn’t dropped out of existence. It was 
stored inside the seed in the fonn of some sort of signal, memory, or latent capacity, 
waiting to express itself in a future generation. Mendel showed that the entity or factor 
responsible for producing a trait (which was later called a “gene”) has some kind of 
existence of its own independent of whether the trait is seen in an individual plant. 

Mendel’s discovery raised an obvious question: Where and how was this hereditary 
memory or signal being stored? Beginning with experiments done in the years after the 
Civil War, biologists began to focus on the cell nucleus. In 1869, Friedrich Miescher, the 
son of a Swiss physician, discovered what would later be called DNA. Miescher was 
interested in the chemistry of white blood cells. To find such cells, he collected pus from 
postoperative bandages. He then added hydrochloric acid to the pus, dissolving all the 
material in the cell except the nuclei. After that he added alkali and then acid to the 
nuclei. Miescher called the gray organic material that fonned from this procedure 
“nuclein,” since it was derived from the nucleus of the cell. Other scientists using staining 
techniques soon isolated banded structures from the nucleus. These came to be called 
“chromatin” (the material we now know as chromosomes ) because of the bright color 




they displayed once stained. When it was later shown that chromatin bands and 
Miescher’s nuclein reacted to acid and alkali in the same way, scientists concluded that 
nuclein and chromatin were the same material. When biologists observed that an equal 
number of chromatin strands combine when an egg and spenn fuse into a single nucleus, 
many concluded that chromatin was responsible for heredity. 

To make further progress toward a solution to the mystery of heredity, geneticists needed 
to study these chromatin bands more closely. In 1902 and 1903, Walter Sutton published 
two papers suggesting a connection between the laws of Mendelian genetics and 
chromosomes. 8 Sutton suggested that Mendel’s laws could be explained by observing 
chromosomes during reproduction. Since offspring receive an equal number of 
chromosomes from each parent, it was possible that they were receiving the capacity for 
different characteristics — Mendel’s traits — from separate maternal and paternal 
chromosomes. Since traits often occurred in pairs, and chromosomes occurred in pairs, 
perhaps the capacity for producing these traits was carried on chromosomes. 

Some scientists thought that this idea could be tested by altering the composition of the 
chromatin bands to see what effect various changes would have on the creatures that 
possessed them. What was needed was a creature that reproduced quickly, possessed a 
relatively simple set of features, and could be bathed in change-producing or 
“mutation-inducing” radiation without raising ethical concerns. Fruit flies were the 
perfect choice. They had a fourteen-day life cycle and only four pairs of chromosomes, 
and nobody was likely to start picketing on their behalf. 

Beginning in 1909 at Columbia University, Thomas Hunt Morgan undertook experiments 
with large populations of fruit flies, subjecting them to a variety of mutagens (i.e., 
substances that cause mutations), increasing their mutation rate manyfold. Then he bred 
them, steadily assembling a mountain of data about the resulting mutations and how often 
they were passed from one generation to the next. He encountered all of the fruit-fly 
mutations found in natural populations, such as “bithorax” and “white eye,” along with 
new ones, such as “vestigial wing” (see Fig. 3.1). After studying many generations, 
Morgan found that some of these traits were more likely to occur in association. 
Specifically, he noticed four linkage groups, suggesting that information-bearing entities 
responsible for passing along these mutations were located physically next to each other 
on the chromosome. Morgan devised a number of experiments to show that genes have a 
definite, linear order on the chromosome. 9 

By 1909, scientists had been able to separate an acidic material from other proteinaceous 
material in the chromatin bands. Chemists soon determined the chemical composition of 
this acidic material. They called it a “nucleic acid,” because it had come from the 
nucleus. They called it a “deoxyribose nucleic acid,” because they were able to identify a 
deoxygenated sugar molecule called ribose in the molecule (see Fig. 3.2, comparing the 
structure of deoxyribose sugar and ribose sugar). Scientists also determined that the 
molecule was made of phosphates and four bases, called adenine, cytosine, guanine, and 
thymine, the chemical fonnulas and structures of which had been known for a while. 




(Figure 3.3 shows the formulas and structures of each of the chemical parts of 
deoxyribonucleic acid, or DNA.) 
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Figure 3.1. A normal fruit fly and three mutant fruit flies of the kind studied by Thomas 
Morgan. 



Figure 3.2. Structure of ribose sugar {left) and deoxyribose sugar {right). 

Science historians often describe the process that leads to a great discovery as “putting 
pieces of a puzzle together.” In the case of DNA, that metaphor is unusually apt. By 
1909, the composition and structure of the chemical parts of DNA were mostly known. 
But the structure of the whole molecule was not. Further progress in the search for the 
secret of hereditary information required scientists to piece together the constituent parts 
of the molecule in various different ways in search of a solution to this puzzle. When the 
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pieces locked in place properly — in accord with all that was known about the dimensions, 
shapes, and bonding proclivities of the constituent parts — a solution to the puzzle would 
be obvious. Everything would fit. But in 1909, scientists were far from understanding 
how all the pieces of the DNA molecule fit together. In fact, for years, many showed little 
interest in detennining the structure of DNA, because they did not think that DNA had 
anything to do with heredity. 

Many scientists overlooked DNA because they were convinced that proteins played the 
crucial role in the transmission of hereditary traits. They favored proteins over DNA 
primarily due to a misunderstanding about the chemical structure of DNA. By the early 
part of the twentieth century, scientists knew that in addition to containing sugars and 
phosphates, nucleic acid was composed of the four bases adenine, thymine, cytosine, and 
guanine, but in 1909 chemist P. A. Levene incorrectly reported that these four nucleotide 
bases always occurred in equal quantities within the DNA molecule. 10 To account for 
this putative fact, he formulated what he called the “tetranucleotide hypothesis.” 
According to this hypothesis the four nucleotide bases in DNA linked together in the 
same repeating order to form a sequence such as 
ATCGATCGATCGATCGATCGATCG. 
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Figure 3.3. The structural formulas of each of the chemical parts of DNA (as well as the 
nucleotide base uracil, which is present in RNA). RNA consists of phosphates; the bases 
uracil, cytosine, guanine, and adenine; and ribose sugar (see Fig. 3.2). 



Levene’s model threw many scientists off the scent, but for understandable reasons. For 
DNA to be the material responsible for producing hereditary traits, it had to have some 
feature that could account for, or produce, the great variety of physiological traits found 
in living organisms. Even the humble fruit flies that Morgan used in his mutation studies 
had many different features — different kinds of eyes, legs, wings, bristles, and body 
proportions. If the capacity for building these structures and traits was something like a 
signal, then a molecule that simply repeated the same signal (e.g., ATCG) over and over 
again could not get the job done. At best, such a molecule could produce only one trait. 
Instead, scientists knew that they needed to discover some source of variable or irregular 
specificity, a source of information, within the heritable material (or germ line) of 



organisms to account for the many different features present in living things. Since the 
sequence of bases in DNA was, according to Levene, rigidly repetitive and invariant, 
DNA’s potential seemed inherently limited in this regard. 

That view began to change in the mid- 1940s for several reasons. First, a scientist named 
Oswald Avery successfully identified DNA as the key factor in accounting for heritable 
differences between different bacterial strains. 1 1 

When Avery was working at the Rockefeller Institute in New York he became intrigued 
by an experiment on Pneumococci bacteria performed by Frederick Griffith. The 
experiment moved from the unsurprising to the surprising. If a deadly strain of the 
bacteria was first heated to death, the strain was harmless when injected into mice. No 
surprise there. The mice were also unhanned when injected with a living but nonvirulent 
strain of the virus. No surprise there either. But then Griffith injected mice with both the 
lethal strain of bacteria that had been heated to death and the living but harmless strain of 
bacteria. The mice died. This was surprising. One would expect the mice to be 
unaffected, since both fonns of bacteria had proven totally harmless before. Injected with 
either of the two strains separately, the mice lived. But when the strains were injected 
together, the mice died as if the dead bacteria had become suddenly lethal again. 1 " (See 
Fig. 3.4.) 

It was almost too strange to believe. It was like those old zombie movies, where the 
walking dead attack and convert nice ordinary people into killing machines. Avery 
wanted to get to the bottom of this strange phenomenon. His laboratory started by taking 
the mice out of the equation. The scientists prepared a rich medium for the bacteria, then 
placed the two strains of bacteria — the living harmless Pneumococci bacteria along with 
the once lethal but now dead strain — into direct contact with each other in the rich 
medium. After several life cycles, Avery was able to begin detecting living versions of 
the lethal, but previously dead strain. 

There were two possibilities. Either the dead strain was coming back to life — but this was 
absurd — or something in the dead strain of bacteria was being transferred to the living 
strain, making it suddenly lethal. With the complicating variable of the mice out of the 
picture, the way was clear for Avery to locate the culprit; he set out to isolate the material 
responsible for this surprising transformation. In 1944, Avery and two of his colleagues, 
Colin MacLeod and Maclyn McCarty, published their findings in the Journal of 
Experimental Medicine. What was the transforming agent? To everyone’s surprise, 
including Avery’s, it appeared to be Levene’s seemingly uninteresting nucleic acid, 
DNA. DNA from the dead strain was being transferred to the living strain, rendering the 
once harmless living strain suddenly lethal. 33 
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Figure 3.4. Frederick Griffith’s injected mice experiment. 



When Erwin Chargaff, of Columbia University, read Avery’s paper, he immediately 
sensed its importance. Fie saw “in dark contours the beginning of a grammar of biology,” 
he recounted. “Avery gave us the first text of a new language or rather he showed us 
where to look for it. I resolved to search for this text.” 14 

Chargaff s experimental work eventually provided additional evidence that DNA could 
be the source of biological variability. It also provided an important clue about the 
structure of the DNA molecule. Chargaff purified samples of DNA and then separated its 
chemical constituents, the sugars, the phosphates, and the four bases. Using techniques of 
quantitative chemical analysis, he determined the relative proportions of each of these 
constituents and, in the process, discovered a puzzling regularity. The quantity of adenine 
always equaled the quantity of thymine, while the quantity of the guanine always equaled 
the quantity of cytosine. 15 



This discovery was puzzling, in part because Chargaff also discovered a surprising 
irregularity — one that contradicted the earlier “tetranucleotide hypothesis” of Levene. 
Chargaff discovered that individual nucleotide frequencies actually differ between 
species, even if they often hold constant within the same species or within the same 
organs or tissues of a single organism. 16 More important, Chargaff recognized that even 
for nucleic acids with the same proportion of the four bases (A, T, C, and G), “enormous” 
numbers of variations in sequence were possible. As he put it, different DNA molecules 
or parts of DNA molecules might “differ from each other... in the sequence, [though] not 
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the proportion, of their constituents.” 

In other words, a strand of DNA might be like a strand of binary computer code. The 
information-rich string of zeroes and ones will have a completely irregular and 
nonrepetitive sequence of the two characters, but given a fairly long string of the binary 
code, one can expect to find very close to the same number of zeroes as ones. Thus 
Chargaff argued that, contrary to the tetranucleotide hypothesis, base sequencing in DNA 
might well display the high degree of variability and irregularity required by any 
potential carrier of heredity. 18 And so, by the late 1940s many had begun to suspect that 
DNA was a good candidate for transmitting hereditary information. But still, no one 
knew how it did this. 



Unlikely Heroes 

James Watson crossed an ocean to find out. As a twenty-three-year-old who had once 
spent much of his spare time bird watching, he did not appear, on casual inspection, to 
possess either the experience or the fire to solve the mystery. Yet there was more to the 
seemingly easygoing young man than met the eye. Watson was a former Chicago Quiz 
Kid game-show prodigy who had entered the University of Chicago at the tender age of 
fifteen. By nineteen, he had finished his bachelor’s degree in biology. By twenty-two, he 
had earned a Ph.D. from the University of Indiana under Salvador Luria, an expert on the 
genetics of viruses. Watson’s doctoral studies focused on viral genetics, but along the 
way he learned a fair amount of biochemistry and radiation genetics. At one point, he 
took a course from Hermann J. Muller, of fruit fly fame. Though he “aced” the course, 
Watson concluded that the best days for gene-radiation studies were past. New methods 
were needed to get DNA to divulge its secrets. 19 

After he graduated, Watson was almost constantly ruminating about biology and keeping 
his ears open for any word about new ways to study DNA. He traveled to Copenhagen 
for postdoctoral research. While there, he performed experiments alongside Danish 
scientist Ole Maaloe that strengthened his growing sense that DNA, not protein, was the 
carrier of genetic information. Then in the spring of 1951, at a conference in Naples on 
X-ray crystallography, he met Maurice Wilkins, the head of the lab at King’s College, 
London. From their conversations, Watson was seized with the idea of moving to 
Cambridge, England, where various experts were gathering to discover the secret of 
heredity 




He landed a position with the Cavendish Laboratory in 1951 under the Austrian Max 
Perutz and Englishman William Lawrence Bragg, both distinguished experts in the use of 
X rays to study large biological molecules. Watson quickly formed a partnership with 
Francis Crick, a theoretical physicist who knew precious little chemistry, but who had 
used advanced mathematics to develop theoretical insights about how to study the 
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structure of proteins by using X rays. In Crick, Watson found a partner who shared his 
thinking about DNA. Both men were interested in genetics, but both thought that a deeper 
understanding of heredity would emerge only after scientists understood “what the genes 
were and what they did.” For Watson, at least, that meant understanding the structure 
of DNA. And he soon convinced Crick that cracking this problem would make it possible 
to understand the transmission of genetic information. Crick’s expertise in understanding 
protein structure and X-ray imaging techniques would come in handy, as would his knack 
for taking insights from disparate fields and finding significant patterns in them that 
other, more specialized scientists missed. 

Watson and Crick also possessed an important quality rarely appreciated in scientists, but 
vital to those attempting to make discoveries or challenge an outmoded framework of 
thought. The two men were perfectly willing to ask questions exposing their own 
ignorance, to embarrass themselves, if necessary, in pursuit of answers. They had no 
world-class reputations to risk and little to lose as they pursued answers unfettered by 
concerns about their own respectability. Nor could they be shamed into pouring their 
energies into gathering original data when what was needed was new thinking. They 
would leave data gathering to others while they focused on the big picture, constantly 
reassembling the pieces of a growing puzzle in search of an elegant and illuminative 
synthesis. 24 

In 1951, after only a few months working on the problem, Watson and Crick presented 
their first model in a seminar at the Cavendish Laboratory in Cambridge. Maurice 
Wilkins, Rosalind Franklin, and two other scientists from King’s College, London, 
attended, as did Professor Lawrence Bragg, their supervisor at the Cavendish. The 
meeting went poorly. Watson and Crick represented DNA as a triple-stranded helix. 
Franklin objected. Though the sugar-phosphate backbone of the molecule might form a 
helix, she insisted, there was not yet “a shred of evidence” for this idea from X-ray 
studies. Such ideas were merely in the air as the result of recent discoveries about the 
presence of helical structures in proteins. 

Other aspects of their model were more obviously mistaken. Watson had miscalculated 
the density of water in the DNA molecule. (The amount of water absorbed by DNA 
detennines its dimensions and whether it will adopt one of two structures, the “A-fonn” 
or “B-form.”) When the correct water density was used to calculate the spatial 
dimensions of DNA, the justification for their model dissolved. Watson and Crick also 
had placed the sugar phosphate backbone in the center of the molecule with the bases 
sticking outward, thus producing a structure that looked something like a gnarled tree 
with stubby branches. Franklin correctly pointed out that DNA could take up water as 
easily as it did only if the phosphate molecules were on the outside, not the inside, of the 




structure. The phosphate groups had to be on the outside of the model where they could 
attract and hold water easily. 2 

Embarrassed for them, if not for himself, Professor Bragg asked Watson and Crick to 
stop work on DNA. Crick was to finish his Ph.D. dissertation; Watson was assigned to 
study viruses. 

Collecting the Clues 

Undaunted by their failure, the two gradually wormed their way back into the action. 
Together they mused and snooped, tinkered with toy models, and picked the brains of the 
various specialists in England and abroad. By 1952 a growing number of scientists had 
set aside the distraction of protein molecules and were focusing squarely on 
deoxyribonucleic acid, including two-time Nobel laureate Linus Pauling, the Caltech 
chemist who had earlier determined the fonn of an important structure within proteins, 
the alpha-helix. Watson and Crick sensed that time was short. At any moment somebody 
might beat them to the discovery of the structure of the mystery molecule. But that fear 
raised an exhilarating possibility: perhaps collectively the scientific community knew 
enough already. Perhaps it wasn’t fresh evidence that was needed, but rather a flash of 
insight about how all the pieces of evidence fit together. 

While others approached the problem methodically, steadily gathering data in their labs, 
Watson and Crick behaved more like gumshoe detectives, heading from one place to 
another looking for clues to help them weigh the merits of competing hypotheses. Then 
on January 28, 1953, Watson procured a draft copy of a scientific manuscript written by 
Linus Pauling from Pauling’s son, Peter, who was working in Cambridge as a Ph.D. 
student under John Kendrew. The senior Pauling was proposing a triple-helix structure 
for DNA that was very similar to the model Watson and Crick had proposed the year 
before. He sent the draft to Peter, who showed it to Watson and Crick. 

Like Watson and Crick’s original model, Pauling’s envisioned a triple-sugar-phosphate 
backbone running up the middle (or inside) of the molecule with the nucleotide bases 
attached on the outside and pointing outward. Watson exhaled in relief. He knew all too 
well that Pauling’s model couldn’t be right. But what was? Two days later, with 
Pauling’s manuscript in hand, he traveled to the lab at the University of London to see 
what he could turn up. The King’s College lab was the place to go to see the best 
current X-ray images of DNA. 

Although the lab employed a cutting-edge technology, conceptually the essence of its 
technique was a simple old trick from physics: throw something at an object and see what 
bounces back or passes through. Then collect the resulting signal and see what you can 
tell about the object under study by analyzing it. Bats navigate the airways using this 
technique. Their echo-location system sends sound waves at objects and then times the 
returning signals so the bats can “see,” or locate the objects around them. As noted in 
Chapter 1, geophysicists use a similar technique. They send sound waves deep 




underground and then collect the resulting echoes in order to image the subsurface of the 
earth. 

The key technology in the search for the structure of DNA employed a variation of this 
strategy as well. Instead of projecting sound waves at DNA, the scientists at King’s 
College directed X rays at fibers of DNA. And instead of analyzing what bounced back, 
they collected the rays that passed through the molecules. By seeing how the direction of 
the X rays changed — how they were diffracted by their target — these scientists were 
eventually able to learn about the structure of DNA. 

Rosalind Franklin was the acknowledged expert on this technique for studying DNA. 
That’s why Watson was there. Fra nk lin had earlier discovered that DNA has two distinct 
forms with different dimensions depending upon whether water is present. This in itself 
was a huge breakthrough, since previously the two forms were jumbled together, 
muddying the results anytime someone tried to use X-ray diffraction to discern the form 
of DNA. Armed with this fresh insight, Franklin set about developing a method for 
separating the two forms. The technique she arrived at was highly demanding, but also 
highly effective. Now she could and did obtain revealing diffraction patterns of the 
“B-form DNA.” When Watson showed up at King’s lab, he had a tense conversation with 
Franklin. He lectured her about helical theory and why DNA must be a helix. Franklin 
insisted that there was no proof of that yet. Fra nk lin rose in anger from behind her lab 
bench, visibly annoyed at Watson’s presumption and condescension. Watson made a 
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hasty retreat from Franklin’s lab, later saying he feared that she might strike him. 




Figure 3.5. Portrait of Rosalind Franklin (1920-58). Courtesy of Science Source/Photo 
Researchers. 




But before leaving the lab, Watson stopped by to see Maurice Wilkins. After a little 
prodding, Wilkins gave Watson a look at Franklin’s best X ray of DNA in the B-fonn. 
The picture showed, quite distinctly, a pattern known as a Maltese cross (see Fig. 3.6). 
Watson was elated. What Crick had taught him told him he was looking at evidence of a 
helix. On the train ride home, Watson sketched the image from memory. Upon seeing 
Watson’s sketch and after questioning Watson, Crick agreed it must be a helix. But what 
kind, and how did the chemical constituents of DNA fit into this structure? 



Clues gleaned from another quarter would ultimately help Watson and Crick answer 
these questions. The year before, they had shared a meal with the gruff, brilliant chemist 
Erwin Chargaff while he was visiting in Cambridge. During the meal they asked a 
number of questions that exposed their own ignorance of some of the relevant chemistry. 
In particular, it became apparent during the meal that they did not know about Chargaff s 
famous correspondences, or “rules,” establishing that the amount of guanine in DNA 
equaled the amount of cytosine (G = C) and the amount of adenine equaled the amount of 
thymine (A = T). The eminent biochemist, who at the time knew virtually everything 
there was to know about the chemistry of DNA, was aghast that this breezy, ambitious 
pair didn’t know these basics. Chargaff even got Crick to admit that he did not kn ow the 
differences in the chemical structure of the four nucleotides bases in the DNA molecule, 
which by then was common knowledge to everyone else in the race. 




Figure 3.6. Maltese cross X-ray image of DNA crystal. Helix on the right represents the 
structure that scientists think produces the Maltese cross X-ray image. 



Chargaff would later muse about the irony of these seemingly uninitiated scientists 
making the discovery of the ages: “I seem to have missed the shiver of recognition of a 
historical moment; a change in the rhythm of the heartbeats of biology. As far I could 



make out, they wanted, unencumbered by any knowledge of the chemistry involved, to fit 
DNA into a helix. I do not remember whether I was actually shown their scale model of a 
polynucleotide chain, but I do not believe so, since they still were unfamiliar with the 
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chemical structures of the nucleotides.” Despite his contempt for this ambitious pair of 
know-nothings, or perhaps because of it, Chargaff passed on to them the correspondences 
he had discovered. 4 

So by the time Watson returned from his visit to Franklin and Wilkins at the end of 
January 1953, he and Crick knew several key facts about nucleic acid. In addition to 
Chargaff s rules, they knew from Franklin’s X-ray images that DNA almost certainly did 
form a helix with a backbone made of sugars li nk ed to phosphates. From X-ray studies of 
the molecule, they also knew the key dimensions of the B-form of the DNA 
molecule — 20 angstroms across and 34 angstroms long for one full turn of the helix (an 
angstrom is the length of one hydrogen atom, about one ten-billionth of a meter). And 
they knew that Franklin was convinced that the sugar-phosphate backbone would have to 
have the phosphates on the outside. 

They also knew they had competition. Watson’s trip to London no doubt had been partly 
motivated by his serendipitous acquisition of Linus Pauling’s manuscript. Though 
Watson was enormously relieved that Pauling had blundered, he suspected that it would 
not take long for Pauling to discover his mistakes. Franklin also knew, immediately upon 
reading Pauling’s proposal, that it was wrong. Pauling had proposed a structure in which 
the sugar-phosphate strands of the helix ran through the center of the molecule. Thus, his 
model required the bases to stick out horizontally from the rising helices. This meant that 
the molecule would not define a smooth or definite edge, but instead a jagged series of 
knobby protrusions. Yet Franklin knew that the X-ray data showed that the molecule had 
a definite 20 angstrom diameter. Such a precise measurement could never have been 
established if the molecule had the kind of uneven edge Pauling envisioned. 

Pauling’s model also failed to account for Chargaffs correspondences. And it had 
density problems. With three helices running right through the middle of the molecule, 
the density of the atoms in the center was too high to be consistent with available data. 
X-ray and chemical studies had revealed the number of bases (about 10) present in the 
molecule per turn of a single helix. A triple helix should, therefore, have a 
proportionately higher density of bases per unit length than a double helix. When Watson 
discovered that density measurements (bases per unit length of a fiber of DNA) agreed 
more closely with the values calculated for a double helix than a triple helix, he and Crick 
not only rejected Pauling’s triple-helix model, but all triple helices. DNA, they were 
convinced, was much more likely to be some sort of double helix. They were much 
closer to a solution, but so too were the others in the race. Watson and Crick sensed that 
time was short, and they had spacing problems of their own. 



Models and Molecules 




To solve the puzzle, Watson began in haste to build a series of models. He first tried to 
make a double helix with the helices again running up the center of the molecule, 
ignoring Franklin’s earlier insistence about the need for keeping the phosphates exposed 
on the outside of the molecule. Crick recalls that they persisted in this approach because 
they thought the demands of biology required it. If DNA was, indeed, the molecule of 
hereditary infonnation and the arrangement of bases conveyed genetic instructions, then 
the structure of DNA must allow some way of copying these instructions. Putting the 
bases on the outside made it easier to envision how the bases were copied during cell 
division. Additionally, placing them on the outside of the molecule eliminated the need to 
figure out how the bases fit together within the strands. “As long as the bases were on the 
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outside, we didn’t have to worry about how to pack them in,” said Crick. 

Unfortunately, an outside placement made it more difficult to explain other facts. First, a 
jagged bases-on-the-outside model would not produce consistent 20-angstrom-diameter 
X-ray measurements — a point Franklin had made earlier. Also, positioning the bases on 
the outside of the sugar-phosphate backbone made it difficult to explain how DNA could 
form crystals at all. Crystals are solidified chemical structures that form a highly regular 
structure called a lattice as the result of a repeating chemical structure. DNA crystals 
form when long strands of (A-fonn) DNA molecules align themselves alongside each 
other. To get a visual image of what a crystal of DNA would look like, imagine a series 
of parallel lanes at a drag racing track with each lane of equal width. Now instead of 
those lanes being perfectly straight, imagine them curving back and forth in parallel to 
each other all the while maintaining that even spacing. 

The jagged edged molecule Watson envisioned in his bases-on-the-outside model was not 
conducive to the formation of regularly spaced, parallel strands of DNA. The jagged 
edges might stick together, but only in an irregular tongue-in-groove sort of way. Yet the 
ability to study DNA structure with X rays is possible only because of the regular spacing 
DNA molecules exhibit when aligned in parallel. 

Placing the bases on the outside and the backbone on the inside also made it more 
difficult to produce a structure that matched known measurements of the length between 
the bases. X-ray measurements indicated that there are about 3.4 angstroms of space 
between each base on the backbone. But running the backbone up the middle of the 
molecule, as opposed to along the outside edge, generates a helix with a steeper pitch 
(angle of ascent). Watson soon found that the pitch of his internal backbone model 
created too much space between the sugar groups (and thus the bases) to match the 
3.4-angstrom measurement derived from X-ray studies. As Watson worked with the 
model, he also realized that the chemical properties of the backbone limited how much it 
could be compressed or stretched to accommodate definite spacing requirements. Further, 
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as historian of science Robert Olby recounts, the structure as a whole looked “awful.” 

A change of direction was clearly needed. Watson began to try models with the 
backbones on the outside. Initially, things went no better. But then two breakthroughs 



came. 




“Aha” Times Two 



The first breakthrough came as a flash of insight to Francis Crick. As Watson began 
trying to fit the bases into the interior of the helices, leaving the backbone on the outside 
of the model molecule, he still encountered difficulties. Watson didn’t know, however, 
that how he oriented the two sugar-phosphate backbones in relation to each other would 
matter. He knew that there is a structural asymmetry in the construction of the backbone 
of the DNA molecule. The phosphate groups attach to a different carbon molecule on one 
side of the ribose sugar than they do on the other. On one side of the ring structure of the 
sugar, the phosphate attaches to what is called its 5’ carbon (five prime carbon); on the 
other, the phosphate attaches to what is called its 3’ carbon (see Fig. 3.7). Chemists know 
these differences well, and so did Watson. 

But as Watson was constructing his models, he had assumed that the two backbones 
should run in the same direction, parallel to each other with each backbone starting from 
the 5' end and running to the 3’ end of the chain. So try as he might, he could not get the 
bases to lock into place — to connect with each other — in a stable structure across the 
molecule. Then a crucial insight came. 
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Figure 3.7. The sugar phosphate backbone of the DNA showing how phosphates are 
bonded to the 5' carbon of one sugar and the 3' carbon of another. The genetic text 
(sequence of bases) is transcribed in the 5' to 3’ direction of the DNA molecule. 



Crick, who was studying the structure of proteins in his Ph.D. dissertation, was familiar 
with helical structures in proteins. He kn ew that sometimes sections of the alpha helices 
of a protein will run “antiparallel” to each other. That is, one section of a helix in a 
protein will run up, then turn back the other direction and run down right alongside the 
first section of the helix, making an upside-down “U-shape.” One day while reading a 
highly technical description of the X-ray diffraction pattern of the B-form of DNA in a 
Medical Research Council report summarizing Rosalind Franklin’s data, 40 Crick realized 
that the pattern she was describing — called a “monoclinic C2 symmetry” pattern — was 
indicative of an antiparallel double helix structure. Instead of the two strands running 
from 5’ to 3’ in the same direction, Crick realized that Franklin’s data indicted that one 
helix was running up in the 5’ to 3' direction and the other strand was running down in the 



5' to 3’ direction the other way. He also realized by studying the dimensions indicated by 
the X-ray data that these antiparallel backbones were not separate molecules or separate 
stretches of the same molecule bent back on itself as occurred in proteins. Instead, these 
were two separate strands entwined around each other. They were simply too close 
together to be anything else (see Fig. 3.8). 
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Figure 3.8. Antiparallel strands of DNA entwined around each other. Represented in two 
dimensions (left) and three dimensions (right). 



Crick’s insight was possible only because of his specialized expertise in crystallography. 
At the time, it was nearly beyond Watson to understand Crick’s argument, and he 
apparently took quite a long time to get it. But once he did and once he started using 
antiparallel sugar-phosphate backbones, his model building would proceed in an entirely 
more satisfying and fruitful direction — though he would first need to have a flash of 
insight of his own. 



Eureka! 

Watson resumed his model building in late February 1953 using cardboard cutouts of his 
own construction as he waited for the metal pieces to come from the machine shop. 41 The 



challenge he faced was to find a way to fit these four irregularly shaped bases into the 
available space between the two twisting sugar-phosphate backbones and to do so in a 
way that was consistent with the chemical properties of each of the bases and the sugar 
molecules to which the bases must attach. He had studied every scientific paper he could 
about the dimensions and chemical proclivities of the bases. Watson learned that the 
bases often form a weak kind of chemical bond with each other called a hydrogen bond. 
Perhaps he could bridge the gap between the two backbones with two bases connected by 
hydrogen bonds. This seemed plausible. But there was a complicating factor. The bases 
weren’t the same size. Two of the bases, adenine and guanine, called the purines, 
exhibited a large double-ring structure, while the two others, cytosine and thymine, called 
pyrimidines, exhibited a smaller single-ring structure (see Fig. 3.9). 

Watson tried to pair the two similar bases with each other. He found that he could get 
them to join together by forming double hydrogen bonds. Perhaps these like-with-like 
base pairs could bridge the gap between the two backbones. Indeed, he could envision 
them forming rungs on the twisting ladder. 
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Figure 3.9. The basic chemical structures of purines and pyrimidines. Notice that 
pyrimidines have a single-ring structure and purines have a doublering structure. The 
bases adenine and guanine are purines. Thymine and cytosine are pyrimidines. Since 
complementary base pairing always involves one pyrimidine and one purine, the rungs on 
the DNA ladder formed by these pairs are closely equivalent in size and shape, whether 
they are adenine-thymine pairs or guanine-cytosine pairs. 



But there were problems. Watson recognized that if he used the like-with-like pairs, he 
would have a hard time producing a helix with a consistent diameter. The pairs of large 
purine bases would cause the helix to bulge in some places, while the pairs of the smaller 
pyrimidines would cause it to narrow in others. And there was another difficulty. Lab 
associate Jerry Donohue, who knew a lot about the chemical structure and properties of 
the nitrogenous bases, insisted that Watson was mistaken. He was. Most of the 
nitrogenous bases come in either one of two variant forms, a common “keto” form and a 
rare “enol” form. These two forms have slightly different structural formulas, reflecting 
slight differences in the configurations of their constituent atoms. For three of the four 
bases, Watson had unknowingly used the rare alternative form of the bases, instead of the 



common “keto” form that is actually present in DNA. Deferring to Donohue’s expertise, 
Watson tried again using the correct (keto) forms of the bases. But this made hydrogen 
bonding much more difficult for like-with-like pairings. 43 

That’s when Watson’s insight came. He remembers that it occurred the morning of 
February 28, 1953. Still waiting for the tin models to arrive from the Cavendish shop, he 
cleared the papers off the desktop and, constructing a makeshift set of cardboard models, 
he began trying to fit the pieces together this way and that. Initially, he reverted to his 
“like-with-like prejudices,” but very quickly he realized this was a dead end. He began to 
try other pairing schemes: “Suddenly I became aware that an adenine-thymine pair held 
together by two hydrogen bonds was identical in shape to a guanine-cytosine pair held 
together by at least two hydrogen bonds. All the hydrogen bonds seemed to form 
naturally; no fudging was required to make the two types of base pairs identical in 
shape.” 44 

The AT pair formed a rung across the helix that was the exact length and shape of the GC 
pair. He placed these pairs inside the two backbones. Now the twisting ladder maintained 
a consistent diameter. No bulging. No pinching. Everything fit. 



The Double Helix 

Watson now saw the double helix rising before him and marveled at how “two irregular 
sequences of bases could be regularly packed in the center of a helix.” 45 This new 
structure also explained Chargaff s rules. If A bonded only to T and if C bonded only to 
G, then of course the amount of A would always equal T and the amount of C would 
always equal G. It was also so easy to see the two strands parting and new As, Ts, Gs, 
and Cs tumbling into place, seeking out hydrogen bonds with their natural opposites on 
the single strands. Soon each single strand would be its own double strand and, presto, 
the winding staircase structure would replicate itself. Crick arrived at the lab and quickly 
discovered that Watson’s revised structure explained several other significant facts of 
DNA chemistry as well. 46 

The new model was beautiful and fit the data exceptionally well. Crick later observed 
that he and Watson “were looking for gold” and by “blundering about” had found the 
gold they sought. 47 At lunch, Crick was telling everyone at the Eagle pub, around the 
comer from the Cavendish on Free School Lane in Cambridge, that they had found the 
secret of life. On April 25, 1953, a seemingly modest paper appeared in the journal 
Nature. The article was only nine hundred words long, was signed by a pair of unknowns, 
J. D. Watson and F. H. C. Crick, and featured the anodyne title “Molecular Structure of 
Nucleic Acids: A Structure for Deoxyribose Nucleic Acid.” It revolutionized biology. 





Figure 3.10. Watson and Crick presenting their model of the DNA double helix. Courtesy 
of A. Barrington Brown/Photo Researchers, Inc. 



The Watson-Crick model made it clear that DNA had an impressive chemical and 
structural complexity. It was a very long molecule composed on the outside of a regular 
arrangement of sugar and phosphate molecules. But on the inside it could contain many 
potentially different arrangements of the four bases. Thus, it had an impressive potential 
for variability and complexity of sequence as required by any potential carrier of 
hereditary information. As Watson and Crick later explained: “The phosphate-sugar 
backbone of our model is completely regular, but any sequence of the pairs of bases can 
fit into the structure. It follows that in a long molecule, many different permutations are 
possible, and it therefore seems likely that the precise sequence of the bases is the code 
which carries the genetic information.” 49 Thus, their paper not only described the 




structure of DNA; it also anticipated what later discoveries would confirm: DNA was a 
repository of information. 

Watson and Crick’s discovery would forever change our understanding of the nature of 
life. At the close of the nineteenth century, most biologists thought life consisted solely of 
matter and energy. But after Watson and Crick, biologists came to recognize the 
importance of a third fundamental entity in living things: infonnation. And this discovery 
would redefine, from that point forward, what theories about the origin of life would need 
to explain. 




4 



Signature in the Cell 



In Darwin’s time few, if any, biologists talked about biological or genetic information, 
but today they routinely refer to DNA, RNA, and proteins as carriers or repositories of 
information. Biologists tell us that DNA stores and transmits “genetic information,” that 
it expresses a “genetic message,” that it stores “assembly instructions,” a “genetic 
blueprint,” or “digital code.” 1 Biology has entered its own information age, and scientists 
seeking to explain the origin of life have taken note. Life does not consist of just matter 
and energy, but also information. Since matter and energy were around long before life, 
this third aspect of living systems has now taken center stage. At some point in the 
history of the universe, biological information came into existence. But how? Theories 
that claim to explain the origin of the first life must answer this question. 

But what exactly is information? What is biological information? Beginning in the late 
1940s, mathematicians and computer scientists began to define, study, measure, and 
quantify information. But they made distinctions between several distinct types or 
conceptions of information. What kind of information does DNA have? What kind of 
information must origin-of-life researchers “explain the origin of’? 

As we will see, it is important to answer these questions because DNA contains a 
particular kind of information, one that only deepens the mystery surrounding its origin. 



Defining Information: Two Distinctions 

Most of us use the term “information” to describe some piece of knowledge. When we 
say so-and-so passed on some interesting information, we mean that so-and-so told us 
something that we didn’t know before, but that we now know, thanks to what we were 
told. In other words, information equals knowledge. The first definition of information in 
Webster’s dictionary reflects this idea: information is “the communication or reception of 
knowledge or intelligence.” Because many of my students had this idea of information 
firmly in mind, they were often confused at first when I talked about information stored 
in a molecule. There is a sense in which it could be said that DNA stores the “ kn ow-how” 
for building molecules in the cell. Yet since neither DNA nor the cellular machinery that 
receives its instruction set is a conscious agent, equating biological information with 
knowledge in this way didn’t seem to quite fit. 

But our English dictionaries point to another common meaning of the term that does 
apply to DNA. Webster’s, for instance, has a second definition that defines information 
as “the attribute inherent in and communicated by alternative sequences or arrangements 



of something that produce specific effects.” Information, according to this definition, 
equals an arrangement or string of characters, specifically one that accomplishes a 
particular outcome or performs a communication function. Thus, in common usage, we 
refer not only to a sequence of English letters in a sentence, but also to a block of binary 
code in a software program as infonnation. Information, in this sense, does not require a 
conscious recipient of a message; it merely refers to a sequence of characters that 
produces some specific effect. This definition suggests a definite sense in which DNA 
contains information. DNA contains “alternative sequences” of nucleotide bases and can 
produce a specific effect. Of course, neither DNA nor the cellular machinery that uses its 
information is conscious. But neither is a paragraph in a book or a section of software (or 
the hardware in the computer that “reads” it). Yet clearly software contains a kind of 
information. 

This seemed reasonably clear to me in 1985 after I first began to think about the DNA 
enigma. But at the time something else puzzled me. I had been working in a technical 
field in which the processing of infonnation on computers was part of our stock and 
trade. I was familiar with the science of information storage, processing, and transmission 
called “infonnation theory.” Information theory was developed in the 1940s by a young 
MIT engineer and mathematician named Claude Shannon. Shannon was studying an 
obscure branch of algebra and, not surprisingly, few people were paying any attention. 
He would later gain notoriety as the inventor of the rocket-powered Frisbee and for 
juggling four balls while cruising down the halls of Bell Laboratories on a unicycle, but 
in 1937 he was putting the finishing touches on a master’s thesis that, to some, may have 
seemed hopelessly dull. Shannon had taken nineteenth-century mathematician George 
Boole’s system of putting logical expressions in mathematical form and applied its 
categories of “true” and “false” to switches found in electronic circuits. 





Figure 4.1. Portrait of the U.S. mathematician and information theorist Claude Shannon 
(1916-2001). Courtesy of the Estate of Francis Bello/Photo Researchers, Inc. 



“I’ve always pursued my interests without much regard to financial value or value to the 
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world,” Shannon confessed in 1983. “I’ve spent lots of time on totally useless things.” 

His master’s thesis, however, wasn’t one of them. Called “possibly the most important, 
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and also the most famous, master’s thesis of the century,” it eventually became the 
foundation for digital-circuit and digital-computer theory. Nor was he finished laying 
these foundations. He continued to develop his ideas and eventually published “The 
Mathematical Theory of Communications.” Scientific American later called it “the 
Magna Carta of the information age.” 4 

Shannon’s theory of information provided a set of mathematical rules for analyzing how 
symbols and characters are transmitted across communication channels. After the 
conference in 1985 awakened my interest in the origin of life, I began to read more about 
Shannon’s theory of information. I learned that his mathematical theory of information 
applied to DNA, but there was a catch. 



Shannon’s Information Theory 



Shannon’s theory of infonnation was based upon a fundamental intuition: information 
and uncertainty are inversely related. The more infonnative a statement is, the more 
uncertainty it reduces. 

Consider this illustration. I live in Seattle, where it rains a lot, especially in November. If 
someone were to tell me that it will rain in November, that would not be a very 
informative statement. I am virtually certain that it will rain then. Saying so does not 
reduce uncertainty. On the other hand, I know very little about what the weather will be 
like in Seattle on May 18. I’m uncertain about it. If a time-traveling weather forecaster 
with a flawless track record were to tell me that on May 18 next year Seattle will have an 
unseasonably cold day resulting in a light dusting of snow, that would be an informative 
statement. It would tell me something I couldn’t have predicted based upon what I know 
already — it would reduce my uncertainty about Seattle weather on that day. 

Claude Shannon wanted to develop a theory that could quantify the amount of 
information stored in or conveyed across a communication channel. He did this first by 
linking the concepts of information and uncertainty and then by linking these concepts to 
measures of probability. 

According to Shannon, the amount of information conveyed (and the amount of 
uncertainty reduced) in a series of symbols or characters is inversely proportional to the 
probability of a particular event, symbol, or character occurring. Imagine rolling a 
six-sided die. Now think about flipping a coin. The die comes up on the number 6. The 
coin lands on tails. Before rolling the die, there were six possible outcomes. Before 
flipping the coin, there were two possible outcomes. The cast of the die thus eliminated 
more uncertainty and, in Shannon’s theory, conveyed more information than the coin 
toss. Notice here that the more improbable event (the die coming up 6) conveys more 
information. By equating infonnation with the reduction of uncertainty, Shannon’s theory 
implied a mathematical relationship between information and probability. Specifically, it 
showed that the amount of information conveyed by an event is inversely proportional to 
the probability of its occurrence. The greater the number of possibilities, the greater the 
improbability of any one being actualized, and thus the more infonnation that is 
transmitted when a particular possibility occurs. 

Shannon’s theory also implied that information increases as a sequence of characters 
grows, just as we might expect. The probability of getting heads in a single flip of a fair 
coin is 1 in 2. The probability of getting four heads in a row is 1/2 x 1/2 x 1/2 x 1/2, that 
is, (1/2) 4 or 1/16. Thus, the probability of attaining a specific sequence of heads and tails 
decreases as the number of trials increases. And the quantity of information increases 
correspondingly. 5 This makes sense. A paragraph contains more infonnation than the 
individual sentences of which it is composed; a sentence contains more information than 
the individual words in the sentence. All other things being equal, short sequences have 
less information than long sequences. Shannon’s theory explained why in mathematical 
tenns: improbabilities multiply as the number of characters (and combination of 
possibilities) grows. 




For Shannon, the important thing was that his theory provided a way of measuring the 
amount of infonnation in a system of symbols or characters. And his equations for 
calculating the amount of information present in a communication system could be 
readily applied to any sequence of symbols or coding system that used elements that 
functioned like alphabetic characters. Within any given alphabet of x possible characters 
(where each character has an equal chance of occurring), the probability of any one of the 
characters occurring is 1 chance inx For instance, if a monkey could bang randomly on a 
simplified typewriter possessing only keys for the 26 English letters, and assuming he 
was a perfectly random little monkey, there would be 1 chance in 26 that he would hit 
any particular letter at any particular moment. 

The greater the number of characters in an alphabet (the greater the value of x), the 
greater the amount of information conveyed by the occurrence of a specific character in a 
sequence. In systems where the value of x is known, as in a code or language, 
mathematicians can generate precise measures of infonnation using Shannon’s equations. 
The greater the number of possible characters at each site and the longer the sequence of 
characters, the greater the Shannon information associated with the sequence. 



What Shannon’s Theory Can’t Say 

But, as I said, there was a catch. Though Shannon’s theory and his equations provided a 
powerful way to measure the amount of information stored in a system or transmitted 
across a communication channel, it had important limits. In particular, Shannon’s theory 
did not, and could not, distinguish merely improbable sequences of symbols from those 
that conveyed a message or “produced a specific effect” — as Webster’s second definition 
puts it. As one of Shannon’s collaborators, Warren Weaver, explained in 1949, “The 
word information in this theory is used in a special mathematical sense that must not be 
confused with its ordinary usage. In particular, information must not be confused with 
meaning.” 6 

Consider two sequences of characters: 



“Four score and seven years ago” 
“nenen ytawoi jll sn mekhdx nnx” 



Both of these sequences have an equal number of characters. Since both are composed of 
the same 26-letter English alphabet, the amount of uncertainty eliminated by each letter 
(or space) is identical. The probability of producing each of those two sequences at 
random is identical. Therefore, both sequences have an equal amount of information as 
measured by Shannon’s theory. But one of these sequences communicates something, 
while the other does not. Why? 

Clearly, the difference has something to do with the way the letters are arranged. In the 
first case, the letters are arranged in a precise way to take advantage of a preexistent 




convention or code — that of English vocabulary — in order to communicate something. 
When Abraham Lincoln wrote the first words of the Gettysburg address, he arranged the 
letters — f-o-u-r — s-c-o-r-e — a-n-d — s-e-v-e-n — y-e-a-r-s — a-g-o — in a specific sequence. 
When he did so he invoked concepts — the concept of “four,” the concept of “years,” and 
so on — that had long been associated with specified arrangements of sounds and 
characters among English speakers and writers. The specific arrangement he chose 
allowed those characters to perform a communication function. In the second sequence, 
however, the letters are not arranged in accord with any established convention or code 
and, for that reason, are meaningless. Since both sequences are composed of the same 
number of equally improbable characters, both sequences have a quantifiable amount of 
information as calculated by Shannon’s theory. Nevertheless, the first of the two 
sequences has something — a specificity of arrangement — that enables it “to produce a 
specific effect” or to perform a function, whereas the second does not. 

And, that’s the catch. Shannon’s theory cannot distinguish functional or message-bearing 
sequences from random or useless ones. It can only measure the improbability of the 
sequence as a whole. It can quantify the amount of functional or meaningful information 
that might be present in a given sequence of symbols or characters, but it cannot 
detennine whether the sequence in question “produces a specific effect.” For this reason, 
information scientists often say that Shannon’s theory measures the 
“information-carrying capacity,” as opposed to the functionally specified infonnation or 
“information content,” of a sequence of characters or symbols. This generates an 
interesting paradox. Long meaningless sequences of letters can have more information 
than shorter meaningful sequences, as measured by Shannon’s information theory. 

All this suggested to me that there are important distinctions to be made when talking 
about information in DNA. In the first place, it’s important to distinguish information 
defined as “a piece of knowledge known by a person” from information defined as “a 
sequence of characters or arrangements of something that produce a specific effect.” 
Whereas the first of these two definitions of information doesn’t apply to DNA, the 
second does. But it is also necessary to distinguish Shannon information from 
information that performs a function or conveys a meaning. We must distinguish 
sequences of characters that are (a) merely improbable from sequences that are (b) 
improbable and also specifically arranged so as to perform a function. That is, we must 
distinguish information-carrying capacity from functional infonnation. 

So what kind of infonnation does DNA possess, Shannon information or some other? To 
answer this question we will need to look at what molecular biologists have discovered 
since 1953 about the role of DNA within the miniature world of the cell. 



Genetic Information: What Does It Do? 

In the wake of Watson and Crick’s seminal 1953 paper, scientists soon realized that DNA 
could store an immense amount of information. The chemistry of the molecule allowed 




any one of the bases to attach to any one of the sugar molecules in the backbone, 
allowing the kind of variable sequencing that any carrier of genetic infonnation must 
have. Further, the weak hydrogen bonds that held the two antiparallel strands together 
suggested a way the molecule might unzip — like two opposing strips of Velcro — to allow 
the exposed sequence of bases to be copied. It seemed that DNA was ideal for storing 
information-rich sequences of chemical characters. But how DNA expressed this 
information and how it might be using that information remained uncertain. Answers 
would soon come, however, thanks to developments in the field of protein chemistry. 



The Puzzle of Proteins 

Scientists today kn ow that protein molecules perform most of the critical functions in the 
cell. Proteins build cellular machines and structures, they carry and deliver cellular 
materials, and they catalyze chemical reactions that the cell needs to stay alive. Proteins 
also process genetic infonnation. To accomplish this critical work, a typical cell uses 
thousands of different kinds of proteins. And each protein has a distinctive shape related 
to its function, just as the different tools in a carpenter’s toolbox have different shapes 
related to their functions. 

By the 1890s, biochemists had begun to recognize the centrality of proteins to the 
maintenance of life. They knew proteins were heavy (“high molecular weight”) 
molecules that were involved in many of the chemical reactions going on inside cells. 
During the first half of the twentieth century, chemists were also able to detennine that 
proteins were made of smaller molecules called amino acids. During this time, many 
scientists thought that proteins were so important that they, rather than DNA molecules, 
were the repositories of genetic information. Even so, until the 1950s, scientists 
repeatedly underestimated the complexity of proteins. 

One reason for this mistaken view was a discovery made by a distinguished British 
scientist. William Astbury was an outstanding scientist with impeccable credentials. 
Formerly chosen “head boy” at a distinguished English boarding school, Astbury had 
studied physics at Cambridge during and after World War I. He then worked with 
William Bragg, the pioneering X-ray crystallographer whose son Lawrence later 
supervised Watson and Crick. Astbury had an infectious enthusiasm for laboratory work 
and a gift for persuasion. But he also had a reputation for sometimes letting his 
enthusiasm get the better of him and for jumping to premature conclusions. 

Astbury was convinced that proteins held the key to understanding life. And partly for 
that reason, he thought they should exhibit a simple, regular structure that could be 
described by a mathematical equation or some general law. Perhaps his occupation biased 
him toward this view. As a crystallographer, he was used to studying highly regular and 
orderly structures. For example, salt crystals, the first structures detennined using X-ray 
techniques, have a highly repetitive or regular structure of sodium and chlorine atoms 
arranged in a three-dimensional grid, a pattern in which one type of atom always has six 
of the other types surrounding it (see Fig. 4.2). Astbury was convinced that proteins — the 




secret of life — should exhibit a similar regularity. During the 1930s he made a discovery 
that seemed to confirm his expectation. 

Astbury used X rays to determine the molecular structure of a fibrous protein called 
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keratin, the key structural protein in hair and skin. Astbury discovered that keratin 
exhibits a simple, repetitive molecular structure, with the same pattern of amino acids 
repeating over and over again — just like the repeating chemical elements in a crystal (or 
the repeating pattern of bases that P. A. Levene had mistakenly proposed as a structure 
for DNA). Astbury concluded that all proteins, including the mysterious globular proteins 
so important to life, would exhibit the same basic pattern he discovered in keratin. Many 
of Astbury’s contemporaries shared the same view. In 1937, for example, two leading 
American biochemists, Max Bergmann and Carl Niemann, of the Rockefeller Institute, 
argued that the amino acids in all proteins occurred in regular, mathematically 
expressible proportions. 8 




Figure 4.2. Three-dimensional structure of a salt crystal. Sodium atoms are represented 
by the light-colored balls; the chlorine atoms by the dark-colored balls. 



During the early 1950s, about the same time that Watson and Crick were on the hunt to 
solve the structure of DNA, another Cambridge scientist made a discovery that would 
challenge the dominant view of proteins. While working just a couple miles from Watson 
and Crick at the now famous Laboratory for Molecular Biology (or LMB), biochemist 
Fred Sanger determined the structure of the protein molecule insulin. Sanger’s discovery 
would later earn him the first of two Nobel prizes in chemistry. Sanger showed that 
insulin consisted of irregular sequences of various amino acids, rather like a string of 
differently colored beads arranged with no discernible or repeating pattern. 9 Subsequent 
work on other proteins would show the same thing: the sequencing of amino acids is 
usually highly irregular and defies description by any general rule. 10 

But old prejudices die hard. Many biologists at the time still expected proteins, 
considered by many the fundamental unit of life, to exhibit regularity, if not in the 
arrangement of their amino acids, then at least in their overall three-dimensional shapes 
or structures. Most thought the three-dimensional structures of proteins would end up 
exhibiting some sort of geometric regularity. Some imagined that insulin and hemoglobin 
proteins, for example, would look like “bundles of parallel rods.” 11 As Johns Hopkins 
biophysicist George Rose recounts, “Protein structure was a scientific terra incognita. 



With scant evidence to go on, biologists pictured proteins vaguely as featureless 
ellipsoids: spheres, cigars, Kaiser rolls.” 1 

Then came the publication of John Kendrew’s paper. Kendrew was the son of academics, 
an Oxford scientist and an art historian. He studied chemistry at Cambridge and 
graduated in 1939 just as World War II was beginning. After doing research on radar 
technology during the war, he took up the study of molecular biology at the Medical 
Research Council Laboratory in Cambridge in 1946. There Kendrew began to work 
closely with Max Perutz, the Austrian crystallographer who, along with Lawrence Bragg, 
had officially supervised Watson and Crick. In 1958, Kendrew made his own 
contribution to the molecular biological revolution when he published a paper on the 
three-dimensional structure of the protein myoglobin. 

“The report in Nature,'" historian Horace Judson recalls, “was marked by a comical tone 
of surprise, of eyebrows raised at eccentricity.” Far from the simple structure that 
biologists had imagined, Kendrew’s work revealed an extraordinarily complex and 
irregular three-dimensional shape, a twisting, turning, tangled chain of amino acids. 
Whereas protein scientists had anticipated that proteins would manifest the kind of 
regular order present in crystals (see Fig. 4.2), they found instead the complex 
three-dimensional structure shown in Figure 4.3. As Kendrew put it, with characteristic 
British understatement, “The big surprise was that it was so irregular.” 14 In the Nature 
paper, he wrote, “Perhaps the most remarkable features of the molecule are its 
complexity and its lack of symmetry. The arrangement seems to be almost totally lacking 
in the kind of regularities which one instinctively anticipates, and it is more complicated 
than has been predicted by any theory of protein structure.” 15 




Figure 4.3. The complex three-dimensional structure of the protein myoglobin. 






But biochemists soon recognized that proteins exhibited another remarkable property. In 
addition to their complex shapes and irregular arrangements of amino acids, proteins also 
exhibit specificity. By specificity, biologists mean that a molecule has some features that 
have to be what they are, within fine tolerances, for the molecule to perform an important 
function in the cell. 

Proteins are specified in two ways. First, proteins display a specificity of shape. The 
strangely irregular shapes of proteins that Kendrew and others discovered turned out to 
be essential to the function of the proteins. In particular, the three-dimensional shape of a 
protein gives it a hand-in-glove fit with other equally specified and complex molecules or 
with simpler substrates, enabling it to catalyze specific chemical reactions or to build 
specific structures within the cell. Because of its three-dimensional specificity, one 
protein cannot usually substitute for another. A topoisomerase can no more perform the 
job of a polymerase than a hatchet can perfonn the function of soldering iron or a 
hammer the job of a wrench. 

Figure 4.4 illustrates how three-dimensional specificity of fit determines the function of a 
protein, in this case, an enzyme. Enzymes are proteins that catalyze specific chemical 
reactions. The figure shows an enzyme called a beta-galactosidase and a two-part sugar 
molecule (a disaccharide) called lactose. Because the enzyme’s shape and dimensions 
exactly conform to the shape and dimensions of the disaccharide molecule, the sugar can 
nestle into the pockets of the enzyme. Once it does, a chemically active part of the 
enzyme, called an active site, causes a chemical reaction to occur. The reaction breaks the 
chemical bonds holding the two parts of the sugar together and liberates two individual 
molecules of glucose, each of which the cell can use easily. 

Consider another example of how the specific shape of proteins allows them to perform 
specific functions. The eukaryotic cell has an uncanny way of storing the infonnation in 
DNA in a highly compact way. (. Eukaryotes are cells that contain a nucleus and other 
membrane-bound organelles; prokaryotic cells lack these features.) Strands of DNA are 
wrapped around spool-like structures called nucleosomes. These nucleosomes are made 
of proteins called histones. And, again, it is the specific shape of the histone proteins that 
enables them to do their job. Histones 3 and 4, for example, fold into well-defined 
three-dimensional shapes with a precise distribution of positive electrical charges around 
their exteriors. This precise shape and charge distribution enables DNA strands to coil 
efficiently around the nucleosome spools and store an immense amount of information in 
a very small space. 16 Thanks in part to nucleosome spooling, the information storage 
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density of DNA is many times that of our most advanced silicon chips. 
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Figure 4.4. An enzyme (fi-D-galactosidase) breaking apart a two-part sugar molecule (a 
disaccharide). Notice the tight three-dimensional specificity of fit between the enzyme 
and the disaccharide at the active site where the reaction takes place. Notice also that the 
active site is small in relation to the size of the enzyme as a whole. 



To visualize how this works, imagine a large wooden spool with grooves on the surface. 
Next, picture a helical cord made of two strands. Then visualize wrapping the cord 
around the spool so that it lies exactly in the hollowed-out grooves of the spool. Finally, 
imagine the grooves hollowed in such a way that they exactly fit the shape of the coiled 
cord — thicker parts nestling into deeper grooves, thinner parts into shallower ones. The 
irregularities in the shape of the cord exactly match irregularities in the hollow grooves. 
In the case of nucleosomes, instead of grooves in the spool, there is an uncanny 
distribution of positively charged regions on the surface of the histone proteins that 
exactly matches the negatively charged regions of the double-stranded DNA that coils 
around it. 18 (See Fig. 4.5.) 

But proteins have a second type of specificity — one that helps to explain the first. 
Proteins do not just display a specificity of shape; they also display a specificity of 
arrangement. Whereas proteins are built from rather simple amino-acid “building 
blocks,” their various functions depend crucially on the specific arrangement of those 
building blocks. The specific sequence of amino acids in a chain and the resulting 
chemical interactions between amino acids largely determine the specific 
three-dimensional structure that the chain as a whole will adopt. Those structures or 
shapes determine what function, if any, the aminoacid chain can perfonn in the 
cell — whether as an enzyme, structural component, or a machine for processing 
information. 19 




Figure 4.5. A highly specific pattern of matching positive and negative charges allows 
information-rich DNA to wrap tightly around histone proteins in a process known as 
nucleosome spooling. Chromosomes are made of DNA spooled on histones. 



When I was teaching college students, I used to illustrate this relationship between 
amino-acid sequencing and protein structure and function with a children’s toy called 
Snap-Lock Beads. Snap-Lock Beads are made of plastic and come in many colors and 
shapes. Each bead has a hole on one end and a plug on the other that allows the beads to 
be snapped together to form a long chain. In my illustration, the individual beads with 
their distinctive shapes represent the twenty different protein-fonning amino acids; the 
chain of beads represents a potential protein; and the connection point between the beads 
represents a chemical bond. As I snapped the different-shaped beads together, I would 
explain to my students that proteins are fonned from amino acids linked together by a 
specific k in d of chemical bond called a peptide bond. As the amino acids “snap” together, 
the resulting chains are called polypeptides. If one of these polypeptide chains folds into a 
specific shape that enables it to perform tasks within the cell, it is called a protein. 

But that, it turns out, is a big “if.” Only if amino acids are arranged in very specific ways 
will the chain fold into useful shapes or conformations . Individual amino acids have 
distinguishing features (called side chains) that exert forces on each other. The resulting 
constellation of forces will cause the chain as a whole to twist and turn and, sometimes, 
fold into a stable structure. Most amino-acid arrangements, however, produce chains that 



do not fold into stable structures at all. Other arrangements generate stable structures, but 
do not perform a function. Relatively few sequences will produce functional structures. In 
each case, it is the arrangement of the amino acids that determines the difference. 
Specificity of sequence distinguishes proteins from ordinary, useless polypeptides. 

I would explain the importance of this specificity to my students by arranging and 
rearranging my Snap-Lock Beads. I would twist the chain to show how one sequence 
might cause the chain to fold into a specific three-dimensional shape. I would then 
rearrange the beads to show how a different sequence and constellation of forces might 
cause the chain to collapse into a limp or amorphous jumble. I emphasized that proteins 
have a type of “specificity” — called “sequence specificity” — that needed to be explained. 
A system or sequence of characters manifests “sequence specificity” if the function of the 
system as a whole depends upon the specific arrangement of the parts. Language has this 
property. Software has this property. And so too do proteins. 

The revolution in molecular biology produced some exciting and surprising discoveries 
about proteins and raised an intriguing question. If the structure of proteins depends upon 
the specific arrangement of its amino acids, what determines the arrangement of the 
amino acids? 



The Sequence Hypothesis 

Discoveries in the 1950s showed that both DNA and proteins are long, linear molecules 
composed of many irregularly arranged subunits of smaller molecules. Could these two 
otherwise very different kinds of molecules be connected in some important, but as yet 
undiscovered way? 

Following Sanger’s discovery, most molecular biologists assumed that proteins could not 

assemble themselves from free-floating amino acids in the cell. Too many processes in 

the cell depend upon particular proteins having just the right shape and sequence of 

amino acids to leave the assembly of these molecules to chance. Nor did it seem likely 

that some general chemical law or uniform force of attraction could explain the assembly 

of these irregular and oddly shaped structures. Instead, as the French molecular biologist 

Jacques Monod would later recall, molecular biologists began to look for some source of 

information or “specificity” within the cell that could direct the construction of these 
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highly specific and complex molecules. 

Watson and Crick’s seminal discovery suggested a source for that information or 
“specificity.” Perhaps it lay along the spine of DNA’s sugar-phosphate backbone. But 
what did the information in DNA have to do with the specific arrangements of amino 
acids in proteins? In 1958, Francis Crick proposed an answer in what came to be called 

DO 

the “sequence hypothesis.” 



The sequence hypothesis suggested that the nucleotide bases in DNA functioned just like 
alphabetic letters in an English text or binary digits in software or a machine code. 




According to Crick’s hypothesis, it is the precise arrangement or sequence of these bases 
that determines the arrangement of amino acids — which, in turn, determines protein 
folding and structure. In other words, the sequence specificity of amino acids in proteins 
derives from a prior specificity of arrangement in the nucleotide bases on the DNA 

O'} 

molecule. 

Crick’s sequence hypothesis was not the kind of hypothesis that could be confirmed by a 
single experiment. Nor was it based upon a single discovery. Rather, it capped a series of 
other discoveries and insights and would require many more experiments and discoveries 
to confirm. It was also a natural extension of what was already known, or at least 
suspected. 

Since the early 1940s, following the publication of a research paper by American 
geneticists George Beadle and Edward Tatum, many geneticists had begun to suspect a 
link between genes and proteins even before it was clear that genes were made of DNA. 
Beadle and Tatum had been studying fruit-fly eyes and bread molds. They used X rays to 
induce mutations in the chromosomal material of these organisms. They discovered that 
these mutations often had a discernable effect on their features. In the case of a bread 
mold called Neurospora, Beadle and Tatum discovered that the mutant forms of mold 
were missing an essential chemical compound — a compound that was produced with the 
help of a particular enzyme. Beadle and Tatum proposed that the mutated gene was 
normally responsible for synthesizing the enzyme in question, but that the mutations in 
the gene had destroyed its ability to do so. They further suggested that it is the job of 
genes generally to produce enzymes; in fact, they suggested that, as a rule, one gene 
produces one enzyme. 24 

Crick’s sequence hypothesis was built on this presumed linkage between genes and 
proteins and on other recent insights, including Avery’s identification of DNA as the 
likely carrier of genetic infonnation, Sanger’s discovery of the irregularity and linearity 
of amino-acid sequences, and Watson and Crick’s own understanding of the chemical 
structure of DNA. If a sequence of bases constitutes a gene, if genes direct protein 
synthesis, and if specific sequences of amino acids constitute proteins, then perhaps the 
specific sequencing of bases determines the specific sequencing of amino acids. By 1958, 
this seemed to Crick to be far more than just a reasonable supposition. The sequence 
hypothesis explained many of the central facts then in currency among the elite circle of 
British, American, and French molecular biologists trying to decipher exactly what DNA 
does. 

Even so, confirming Crick’s sequence hypothesis would eventually involve at least two 
additional kinds of experimental results. In a narrow sense, confirmation of the sequence 
hypothesis meant establishing that changes in genes made a difference in the amino-acid 
sequence of proteins. By the early 1960s, scientists had developed many techniques for 
studying the effects of changes in DNA sequences on proteins. These techniques enabled 
scientists to establish a definitive link between base sequences in DNA and the sequences 
of amino acids. Experiments using such techniques eventually revealed a set of 




correspondences between specific groups of bases and individual amino acids. These 
correspondences came to be called the “genetic code” (see Fig. 4.6). 
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Figure 4.6. The standard genetic code showing the specific amino acids that DNA base 
triplets specify after they are transcribed and translated during gene expression. 



In a broader sense, however, the confirmation of the sequence hypothesis would await the 
full explication of what is now called the gene-expression system — the system by which 
proteins are constructed from the infonnation in DNA. It was one thing to know that 
there is a correlation between specific bases sequences in DNA and specific amino acids 
in proteins. It was quite another to leam how that linkage is produced. Nevertheless, by 
the early 1960s, a series of experiments established how the information in DNA 
produces proteins and, in the process, provided a deeper and more comprehensive 
confirmation of Crick’s bold hypothesis. 26 

In the next chapter, I describe the intricate process of gene expression in more detail. 
Here I simply give a brief sketch of what scientists discovered about how the process 



works, since knowing that will enable me to answer the question I raised about the nature 
of biological information at the beginning of this chapter. 

Gene expression begins as long chains of nucleotide bases are copied during a process 
known as “transcription.” During this process, the genetic assembly instructions stored on 
a strand of DNA are reproduced on another molecule called “messenger RNA” (or 
mRNA). The resulting single-stranded copy or “transcript” contains a sequence of RNA 
bases precisely matching the sequence of bases on the original DNA strand. (RNA also 
uses chemicals called bases to store genetic infonnation, but it uses a slightly different 
chemical alphabet than DNA. RNA substitutes a base called uracil for the base thymine 
used in the DNA.) 

After it is produced, the messenger-RNA molecule travels to the ribosome, a molecular 
machine that helps translate the mRNA assembly instructions. These instructions consist 
of a series of three-letter genetic “words” called “codons.” Each codon consists of three 
bases and directs the cell to attach a specific amino acid to a growing chain of other 
amino acids. For example, the mRNA word UUA directs the ribosome to attach the 
amino acid leucine, whereas AGA specifies the amino acid arginine. Other codons direct 
the ribosome to start or stop building proteins. This translation process occurs with the 
aid of specific adapter molecules (called transfer RNAs, or tRNAs) and specific enzymes 
(called aminoacyl-tRNA synthetases). (See Fig. 4.7. To view an animation of this 
process of gene expression, see this book’s Web site at www.signatureinthecell.com.) 

By the way, the elucidation of the gene-expression system shed further light on the 
experiment with the dead mice discussed in the last chapter. Oswald Avery correctly 
detennined it was DNA, not bacteria living or dead, that caused the mice to die. But 
when scientists established that DNA directs protein synthesis, Avery’s results were 
viewed in a new light. DNA caused the mice to die, all right, but it did so by directing the 
production of proteins that were toxic to the mice (though, obviously, not to the bacteria 
from which the DNA was derived). 
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Figure 4.7. A simplified schematic of gene expression showing both transcription and 
translation. (The enzymes called tRNA synthetases are not pictured, although they are 
essential to the process.) Courtesy of I. L. Cohen of New Research Publications. 



In any case, with the explication of gene expression complete, Francis Crick’s sequence 
hypothesis was formally confirmed. But what kind of information does DNA contain? 
The discovery that DNA directs protein synthesis enabled molecular biologists working 
in the two decades after the formulation of Shannon’s theory to answer this question. 



Shannon Information or Shannon Plus? 

A parable may help to illustrate a crucial distinction and help define some key terms we 
will need throughout the rest of this book. Mr. Smith has promised to pick up Mr. Jones 
at Reagan National Airport in Washington when he returns from Chicago, but he has 



asked Jones to call just after he arrives. When he steps off the plane, Jones reaches into 
his pocket for Smith’s number in order to make the call. He finds that he has lost the slip 
of paper with the number on it. Nevertheless, Jones doesn’t panic. He knows that all 
long-distance phone numbers have ten characters and are constructed from the same ten 
digits. He writes down the digits 0, 1,2, 3, 4, 5, 6, 7, 8, 9 on another piece of paper. He 
gives a sigh of relief. Now he has the raw materials, the building blocks, to generate 
Smith’s ten-digit phone number. 

He quickly arranges the digits on his list at random into a sequence of ten characters. 
Now he has some information, some Shannon information. Being mathematically 
inclined, he quickly calculates how much information he has using Shannon’s familiar 
formula. He’s relieved and impressed. His new ten-digit phone number has a lot of 
information — 33.2 bits to be exact. He calls the number. Unfortunately, he has a problem. 
The sequence of characters is not arranged “to produce a specific effect.” Instead of 
reaching Smith, he gets a recording from the phone company: “The number you have 
dialed is not in service. . .” 

After a while Mr. Smith gets worried. He hasn’t received the call from Jones that he was 
expecting. But then he remembers that he has some information that can help him reach 
Jones. He has Jones’s cell phone number. It also has ten digits. These digits are arranged 
in a definite sequence: 202-555-0973. He enters them in his own cell phone and presses 
the call button. Jones answers and tells Smith of his arrival and location. They connect, 
and there is a happy ending. 

In our illustration, both Smith and Jones have an equally improbable sequence of ten 
characters. The chance of getting either sequence at random is the same: 1/10 x 1/10 x 
1/10 x 1/10 x 1/10 x 1/10 x 1/10 x 1/10 x 1/10 x 1/10, or 1 chance in 10 10 . Both 
sequences, therefore, have information-carrying capacity, or Shannon information, and 
both have an equal amount of it as measured by Shannon’s theory. Clearly, however, 
there is an important difference between the two sequences. Smith’s number is arranged 
in a particular way so as to produce a specific effect, namely, ringing Jones’s cell phone, 
whereas Jones’s number is not. Thus, Smith’s number contains specified information or 
functional information, whereas Jones’s does not; Smith’s number has information 
content, whereas Jones’s number has only information-carrying capacity (or Shannon 
information). 

Both Smith’s and Jones’s sequences are also complex. Complex sequences exhibit an 
irregular, nonrepeating arrangement that defies expression by a general law or computer 
algorithm (an algorithm is a set of instructions for accomplishing a specific task or 
mathematical operation). The opposite of a complex sequence is a highly ordered 
sequence like ABCABCABCABC, in which the characters or constituents repeat over 
and over due to some underlying rule, algorithm, or general law. 



Infonnation theorists say that repetitive sequences are compressible, whereas complex 
sequences are not. To be compressible means a sequence can be expressed in a shorter 




form or generated by a shorter number of characters. For example, a computer 
programmer could write two commands that would generate a 300-character sequence of 
repeating ABCs simply by writing the commands “write ABC” and then “repeat 100 
times.” In this case, a simple rule or algorithm could generate this exact sequence without 
each individual character having to be specified. Complex sequences, however, cannot be 
compressed to, or expressed by, a shorter sequence or set of coding instructions. (Or 
rather, to be more precise, the complexity of a sequence reflects the extent to which it 
cannot be compressed.) 

Infonnation scientists typically equate “complexity” with “improbability,” whereas they 
regard repetitive or redundant sequences as highly probable. This makes sense. If you 
know that there is a reason that the same thing will happen over and over again, you are 
not surprised when that thing happens; nor will you regard it as improbable when it does. 
Based upon what we know about the way the solar system works, it is very probable that 
the sun will rise tomorrow. If you know that a computer is executing a command to 
“repeat” — to generate the same sequence over and over again — then it is highly probable 
that the sequence will arise. Since infonnation and improbability are inversely related, 
high-probability repeating sequences like ABCABCABCABCABCABC have very little 
information (either carrying capacity or content). And this makes sense too. Once you 
have seen the first triad of ABCs, the rest are “redundant”; they convey nothing new. 
They aren’t infonnative. Such sequences aren’t complex either. Why? A short algorithm 
or set of commands could easily generate a long sequence of repeating ABCs, making the 
sequence compressible. 

In our parable, both Smith and Jones have ten-digit numbers that are complex; neither 
sequence can be compressed (or expressed) using a simpler rule. But Smith’s sequence 
was specifically arranged to perform a function, whereas Jones’s was not. For this reason, 
Smith’s sequence exhibits what has been called specified complexity, while Jones’s 
exhibits mere complexity. The term specified complexity is, therefore, a synonym for 
specified information or information content. (See Fig. 4.8.) 








£X- ATJC J^C 
hnd wad (CW^rAO 



EX. AL#bJTZ){$C T 
RAMt'C’K POL/PtFTl&E 
w ofc >£>UHeft< . 



£X ‘TIME Aw& lit* UJAlTT R3fe W£> MAw' 
v FfMOTouAL tuA Awt> TOGTEE^ ^ 



Figure 4.8. Examples of order, complexity, and specified complexity. 



So what kind of information does DNA contain, Shannon information or specified 
information? Mere complexity or specified complexity? The answer is — both. 

First, DNA certainly does have a quantifiable amount of information-carrying capacity as 
measured by Shannon’s theory. Since DNA contains the assembly instructions for 
building proteins, the gene-expression system of the cell functions as a communication 
channel." 9 Further, the nucleotide bases function as alphabetic characters within that 
system. This enables scientists to calculate the information-carrying capacity of DNA 
using Shannon’s equations. Since, at any given site along the DNA backbone, any one of 
four nucleotide bases may occur with equal ease, the probability of the occurrence of a 
specific nucleotide at that site equals 1/4. For the occurrence of two particular nucleotide 
bases, the odds are 1/4 x 1/4. For three, 1/4 x 1/4 x 1/4, or 1/64, or (1/4), 3 and so on. 31 
The information-carrying capacity of a sequence of a specific length n can then be 
calculated using Shannon’s familiar expression (/ =-log 2 p) once one computes a 
probability value (p ) for the occurrence of a particular sequence n nucleotides long where 
p = (l/4) n . The p value thus yields a corresponding measure of information-carrying 
capacity or syntactic information for a sequence of n nucleotide bases. 

Just as mathematicians and engineers can apply Shannon’s theory to analyze a written 
text, a cryptographic transmission, or a section of software, mathematical biologists can 



apply the theory to analyze the infonnation-carrying capacity of a DNA, RNA, or protein 
molecule. That is what Charles Thaxton meant when he told me in Dallas in 1985 that the 
treatment of DNA and English text is “mathematically identical.” Both systems of 
symbols or characters can be analyzed the same way. As I write this sentence, the 
placement of each additional letter eliminates twenty-five other possible letters and a 
corresponding amount of uncertainty. It, therefore, increases the information of the 
sentence by a quantifiable amount as measured by Shannon’s theory. Similarly, at each 
site along the DNA molecule any one of the four bases is possible. Thus, the placement 
or presence of any one of the bases eliminates uncertainty and conveys a quantifiable 
amount of infonnation according to Shannon’s theory. 

Is this significant? In several ways, it is. It is certainly remarkable that DNA can carry or 
encode information using chemical subunits that function as alphabetic characters. It is 
also remarkable that DNA fonns part of a communication channel that can be analyzed 
so readily using the mathematical tools of Shannon’s infonnation theory. Further, 
scientists have applied Shannon’s information theory to generate quantitative measures of 
the infonnation-carrying capacity (or brute complexity) of DNA sequences and their 
conesponding proteins. These analyses have shown that these molecules are highly 
complex, and quantifiably so. DNA, RNA, and proteins have a tremendous capacity, at 
least, to store and transmit information. 

Nevertheless, the ease with which information theory applies to molecular biology has 
also created some confusion. The sequences of nucleotide bases in DNA and the 
sequences of amino acids in proteins are highly improbable and, therefore, have large 
information-carrying capacities. Knowing this, some scientists have mistakenly described 
DNA and proteins as if they contained only Shannon infonnation or possessed mere 
information-carrying capacity. 

Yet, like meaningful sentences, lines of computer code, or the phone number that Smith 
used to ring Jones’s cell phone, the arrangements of bases in DNA and amino acids in 
proteins are also specified with respect to function. This judgment is a direct consequence 
of the experimental verification of Francis Crick’s sequence hypothesis. Indeed, since the 
confirmation of the sequence hypothesis in the early 1960s, biologists have known that 
the ability of the cell to build functional proteins depends upon the precise sequential 
arrangement of the bases in DNA. Thus, molecular biologists beginning with Francis 
Crick have equated biological information not only with improbability (or complexity), 
but also with “specificity,” where “specificity” or “specified” has meant “necessary to 
function.” 32 

Thus, in addition to a quantifiable amount of Shannon infonnation (or complexity), DNA 
also contains infonnation in the sense of Webster’s second definition: it contains 
“alternative sequences or arrangements of something that produce a specific effect .” 
Although DNA does not convey information that is received, understood, or used by a 
conscious mind, it does have information that is received and used by the cell’s 
machinery to build the structures critical to the maintenance of life. DNA displays a 




property — functional specificity — that transcends the merely mathematical formalism of 
Shannon’s theory. 



Is this significant? In fact, it is profoundly mysterious. Apart from the molecules 
comprising the gene-expression system and machinery of the cell, sequences or structures 
exhibiting such specified complexity or specified information are not found anywhere in 
the natural — that is, the nonhuman — world. Sequences and structures exhibiting either 
redundant order or mere complexity are common in the chemical substrate of nature. But 
structures exhibiting specified complexity are completely unknown there apart from 
DNA, RNA, and proteins. As the origin-of-life biochemist Leslie Orgel observes: “Living 
organisms are distinguished by their specified complexity. Crystals... fail to qualify as 
living because they lack complexity; mixtures of random polymers fail to qualify because 
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they lack specificity.” Nevertheless, human artifacts and technology — paintings, signs, 
written text, spoken language, ancient hieroglyphics, integrated circuits, machine codes, 
computer hardware and software — exhibit specified complexity; among those, software 
and its encoded sequences of digital characters function in a way that most closely 
parallels the base sequences in DNA. 

Thus, oddly, at nearly the same time that computer scientists were beginning to develop 
machine languages, molecular biologists were discovering that living cells had been 
using something akin to machine code or software all along. To quote the 
information scientist Hubert Yockey again, “The genetic code is constructed to confront 
and solve the problems of communication and recording by the same principles 
found... in modern communication and computer codes.” Like software, the coding 
regions of DNA direct operations within a complex material system via highly variable 
and improbable, yet also precisely specified, sequences of chemical characters. How did 
these digitally encoded and specifically sequenced instructions in DNA arise? And how 
did they arise within a channel for transmitting information? 



Explaining the “Origin of’ 

As we saw in Chapter 2, early theories of the origin of life did not need to address, nor 
did they anticipate, this problem. Since scientists did not know about the 
information-bearing properties of DNA, or how the cell uses that functionally specified 
information to build proteins, they did not worry about explaining these features of life. 
But after 1953 the landscape changed irrevocably. Any idea of a repetitive, crystalline 
order at the foundation of life gave way to a picture of life in which information-rich 
macromolecules direct the metabolism and machinery of living things. Origin-of-life 
researchers have accordingly turned their attention to this crucially specified form of 
information, the origin of which constitutes the central aspect of the DNA enigma. But 
before examining how scientists have attempted to explain this mystery, it is important to 
examine another feature of the informational system at work within the cell, since there is 
another facet of the DNA enigma that scientists investigating the origin of life must now 
explain. 
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The Molecular Labyrinth 



A military surgeon at a primitive field hospital is treating a battalion commander crucial 
to the success of a coming campaign. The surgeon needs to remove shrapnel lodged in 
the soldier’s internal organs, but to do so effectively, the surgeon needs some advanced 
software and imaging technology unavailable at the field hospital. He radios his superior 
and tells him the problem. The supervisor in turn relates the situation to a bureaucrat at a 
major hospital in Washington, D.C. Unfortunately, the bureaucrat is Mr. Jones from the 
previous chapter. 

Jones quickly goes to work procuring the necessary software for running the diagnostic 
imaging technology. With it, he boards a military plane and flies to the remote location. 
He hands the surgeon a high-density storage device and raises a hand to forestall 
expressions of gratitude. “Don’t mention it,” he says. “Solving problems is what I do.” 

And yet the problem remains. It’s not the software. The information in the file is 
error-free and contains precisely the program and instructions needed to conduct the 
diagnostic procedure. The problem is the surgeon does not have either the computer or 
the operating system that would allow him to use the software. These are built into the 
diagnostic equipment — the imaging machine — that the surgeon was missing in the first 
place. The surgeon has a software file that he cannot access, translate, or use. The 
information is useless to him. The soldier dies. The war drags on. 

This story illustrates what in the realm of human technology and experience is obvious: 
encoded information is worthless without a system — whether mind or machine — that can 
use, read, translate, or otherwise process that information. What does this tell us about the 
DNA enigma? 

As scientists began to discover more about how the cell uses the information in DNA to 
build proteins, they realized that DNA is only part of a complex system for expressing 
and processing information. They realized that without this whole system, DNA by itself 
could do nothing. 

In the previous chapter, we saw that the cell uses the information stored in DNA to build 
proteins. Establishing this enabled us to answer a question about the kind of information 
that DNA contains, namely, functionally specified information. Nevertheless, the 
previous chapter did not say much about how the cell processes the information in DNA 
or how scientists came to leam about it. This is worth looking at more closely because it 
turns out that the cell’s information-processing system is itself one of the key features of 
life that any theory of the origin of life must explain. 



Unlike the unfortunate surgeon in our opening story, the cell does have a system for 
processing stored information. As for how it works: the key insight came when Francis 
Crick realized that the cell must be using a code to translate infonnation from one form to 
another and that this code was mediated by an interdependent system of molecules. This 
insight raised an age-old question: the question of the chicken and the egg. 



Deducing a Code 

By the late 1950s the relationship between DNA and protein was coming into focus. By 
then leading molecular biologists understood that the three-dimensional specificity of 
proteins depended upon the one-dimensional specificity of their amino-acid sequences. 
They also suspected that specific arrangements of amino acids in protein chains derived 
in turn from specific sequences of nucleotide bases on the DNA molecule. Yet the 
question remained: How does the sequence of bases on the DNA direct the construction 
of protein molecules? How do specific sequences in a four-character alphabet generate 
specific sequences in a twenty-character alphabet? 

Francis Crick again anticipated the answer: the cell is using some kind of a code. Crick 
first began to suspect this as he reflected on a proposal by George Gamow, a 
Russian-born theoretical physicist and cosmologist who had, in the postwar years, turned 
some of his prodigious intellectual powers to reflect on new discoveries in molecular 
biology. 

Gamow had immigrated to the United States in the 1930s to take an appointment at 
George Washington University after working at the famed Theoretical Physics Institute 
in Copenhagen. In 1953 and 1954, he proposed a model to explain how the specific 
sequences in DNA generate the specific sequences in proteins. According to Gamow’s 
“direct template model,” as it was called, protein assembly occurred directly on the DNA 
strand. Gamow proposed that proteins fonned as amino acids attached directly to the 
DNA molecule at regularly spaced intervals. Gamow thought the amino acids could 
nestle into diamond-shaped cavities in DNA that formed in the space between the two 
backbones (see Fig. 5.1). In this model, a group of four nucleotide bases from two 
parallel strands of DNA made a specifically shaped hollow into which one and only one 
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amino acid could fit. As each group of nucleotides acquired an amino-acid partner, the 
amino acids would also link with each other to form a chain. According to Gamow, the 
nearly identical spacing of amino acids in protein and of bases in DNA enabled the direct 
“matching” of amino acids to nucleotide groups on the DNA template. This matching 
occurred because of the supposed fit between the cavities produced by the bases and the 
shape of the side chain of the amino acids and the chemical affinity between the bases 
and amino acids. 





Figure 5.1. A simplified version of George Gamow’s own drawing depicting his direct 
template model of protein synthesis. The diamonds represent places where he thought 
various twenty amino acids would bond to the DNA double helix. The circles represent 
one of the four nucleotide bases. Gamow thought that the amino acids would nestle into 
the grooves between the nucleotide bases and that the structure of the DNA molecule 
would somehow determine where individual amino acids would bond, thereby directly 
determining protein sequencing. 



Francis Crick first recognized the futility of this scheme. In a famous communication to 
members of the “RNA Tie Club,” Crick explained that there was nothing about either the 
chemical properties or shapes of the bases to ensure that one and only one amino acid 
would fit into, or attach to, the cavities created by a group of bases. 4 In the first place, 
many of the amino acids were difficult to differentiate structurally, because they had 
similar side chains. Second, the bases themselves did not necessarily create shapes that 
either matched these shapes or, still less, differentiated one from another. As Crick put it, 
“Where are the nobby hydrophobic surfaces to distinguish valine from leucine from 
isoleucine? Where are the charged groups, in specific positions, to go with the acidic and 
basic amino acids?” 5 As Judson explains, “Crick was a protein crystallographer, and 
knew of no reason to think that Gamow’s holes in the helix could provide the variety or 
precision of shapes necessary to differentiate a score or more of rather similar objects.” 6 

Y et in the absence of such spatial matching and the formation of corresponding chemical 
attachments, there could be no reliable transmission of information. For the direct 
template model to explain the irregularity and specificity of the amino-acid sequences in 
proteins, individual bases (or groups of bases) needed to manifest discriminating spatial 
geometries. Figuratively speaking, DNA bases not only needed to “zig” in a way that 
matched the specific “zags” of each amino acid, but DNA needed to do so at irregular 
intervals in order to produce the irregular sequencing of amino acids that characterizes 
proteins. Yet as Crick realized, both the individual bases themselves and their various 
combinations lacked distinguishing physical features that could account for the 
specificity of amino-acid sequencing. Further, the geometry of the DNA molecule as a 
whole, at the level of its gross morphology, presents a highly repetitive series of major 
and minor grooves (see Fig. 5.2). Therefore, it could not function as a direct template for 
protein synthesis. As Crick explained, “What the DNA structure does show... is a specific 
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pattern of hydrogen bonds, and very little else.” 
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Figure 5.2. The large-scale form of the DNA helix showing its major and minor grooves. 



If the chemical features and shapes of the DNA bases do not directly account for the 
specific sequencing of proteins, what does? Crick remained adamant that the specific 
arrangement of the nucleotide bases, not anything about their physical or chemical 
features per se, dictated amino acid sequencing. 8 

Crick’s insight had profound implications. If a single protein could not copy the 
information in DNA directly, as the direct template model suggested, then, as Jacques 
Monod would later explain, “you absolutely needed a code.” 9 And so Crick postulated a 
third factor consistent with his original sequence hypothesis. He proposed the existence 
of a genetic code — a means of translating information from one chemical domain into 
another. 

To envision what Crick had in mind, imagine having a human alphabet that uses four and 
only four distinct shapes that combine in various specific ways to form not just words, 
but words that correspond to individual letters in a larger alphabet roughly the size of the 
English alphabet; this larger alphabet then uses its letters (each of which is one of those 
words composed of the four shapes) to build sentences. Of course, we actually have a 
symbol system that does much the same thing. The binary code that computer programs 
use has a translation key that enables programmers to produce English text from 
sequences of binary digits. Each letter in the English alphabet is represented by a unique 
combination of two character types, 0’s and l’s. For example, in ASCII code the letter A 
is represented by the sequence 100 0001, the letter B by the sequence 100 0010, and so 
on. Each of the letters of the twenty-six-letter English alphabet has a corresponding 
representation in the two-digit numeric alphabet of this binary system (see Fig. 5.3). 1 
Crick realized that if his sequence hypothesis was true, then there must be some similar 
translation system in the cell — one that determined how sequences written in the 
four-character alphabet of DNA are converted into sequences that use a twenty-letter 
amino-acid alphabet. The DNA detective was now in the business of code breaking as 
well. 
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Figure 5.3. Part of the ASCII code. 



Yet in a physical system, as opposed to a social or linguistic one, a code must have a 
physical expression. Crick postulated the existence of a third molecule, an adapter 
molecule functioning as a translating device that could recognize and convert the 
information in the sequential arrangements of the bases into specific amino-acid 
sequences. 11 More specifically, he proposed the existence of twenty separate adapter 
molecules corresponding to each of the twenty protein-forming amino acids. Each 
adapter would, by the familiar mechanism of complementary base pairing, bind to a 
section of DNA text at one end and to a specific amino acid at the other. Crick also 
proposed the existence of specific enzymes (one for each of the twenty adapter-amino 
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acid pairs) to connect the specific amino acids and their corresponding adapters. The 
set of correspondences between sections of genetic text, on the one hand, and a specific 
amino acid, on the other, constituted a genetic code (see Fig. 4.6). 

Though these correspondences were mediated physically by adapter molecules and 
enzymes, this complex system, as conceived by Crick, would be governed as much by the 
functional requirements of information transfer as by rules of chemical affinity — as much 
by a set of chemically arbitrary conventions as by the necessary relations of 
physical-chemical law. Indeed, as Crick imagined this system, nothing about the physical 
or chemical features of the nucleotides or amino acids directly dictated any particular set 
of assignments between amino acids and bases in the DNA text. The code could not be 
deduced from the chemical properties of amino acids and nucleotide bases. It had to be 
cracked. Just as a specific letter of the English language can be represented by any 
combination of binary digits, so too could a given amino acid correspond to any 



combination of nucleotide bases. The assignments are, in both cases, arbitrary. For this 
reason, the progressive elimination of the many chemically possible types of codes would 
eventually prove a laborious task. Yet precisely this feature of chemical 
underdetermination would also later prove the key to understanding DNA’s 
information-carrying capacity. As Judson explains, “Crick freed the nucleic acid template 
of its most stringent physical limitation, allowing it to be thought about formally” as an 
information carrier rather than merely a chemical substance. 

Crick’s proposal was striking in its shear theoretical audacity. Biochemistry had not a 
shred of direct evidence for the existence of adapter molecules or their corresponding 
enzymes. Crick simply deduced the need for a code by thinking about what would be 
needed to make the cell’s communication system work. Only a code could facilitate the 
translation of the information from DNA’s four-character base sequences into the 
twenty-character “language” of proteins. His adapter hypothesis followed logically, and 
intuitively, from an understanding of the functional requirements of infonnation transfer 
and the limited informational capacity of the chemical constituents of the relevant 
molecules themselves. 

Simple regular geometries plus rules of chemical affinity did not and could not generate 
the specific complexity present in functional proteins. But complex combinations of a 
few physically simple bases functioning as characters or symbols could specify a vast 
array of possible structures, just as only two symbols, when variously arranged, can 
specify many characters by means of the ASCII code. Yet if the sequential 
arrangement — and nothing else — about the bases dictated protein structure, then the cell 
needed a means of translating and expressing the infonnation stored on DNA. The cell 
needed a code, albeit one physically instantiated as part of an integrated system for 
translating and expressing genetic infonnation. And so Crick conceived the “adapter 
hypothesis” — described by Judson as “the postulation, from theoretical necessity, of a 
new biochemical entity.” 14 

What Crick would postulate on the grounds of functional necessity took nearly five years 
of intensive research and many transatlantic communications and conferences to verify 
and elucidate. But during the late 1950s and early 1960s, a series of experiments enabled 
scientists to show how the infonnation in DNA directs protein synthesis. First, Paul 
Zamecnik, a researcher at Massachusetts General Hospital, discovered that proteins were 
produced in cellular structures called ribosomes, located in the outer cytoplasm “far” 
from the nucleus where DNA resides. 15 This discovery suggested the need for 
mechanisms to transcribe, transport, and translate the information in DNA so that 
amino-acid chains could be constructed at these sites. Because Zamecnik had discovered 
that ribosomes contained no DNA, Francis Crick, along with two other molecular 
biologists, Francois Jacob and Sydney Brenner, suggested that another type of molecule, 
a copy of the original DNA gene, actually directed the synthesis of proteins at the 
ribosomes. They proposed that this “messenger” molecule was made of RNA, or 
ribonucleic acid. Soon Brenner and others found this messenger RNA operating as he, 
Crick, and Jacob had anticipated. 16 




About the same time, Severo Ochoa, a Spanish-born physician working at the New York 
University Medical School, and Marshall Nirenburg, an American at the National 
Institutes of Health in Maryland, performed experiments that enabled scientists to 
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decipher the genetic code. Ochoa identified an enzyme that enabled him to synthesize 
RNA molecules. Nirenburg used Ochoa’s technique to make a synthetic form of 
messenger RNA that he then used to direct protein synthesis. 18 Together their work 
showed that groups of three nucleotides (called codons) on the mRNA specify the 
addition of one of the twenty protein-forming amino acids during the process of protein 
synthesis. Other scientists discovered that the cell uses a set of adapter molecules to help 
convert the information on mRNA into proteins just as Crick expected. 19 Indeed, Crick’s 
adapter molecules and their corresponding enzymes functioned much as he had originally 
envisioned, albeit as part of a far more complex process than even he had foreseen. 



“CAD-CAM” and Protein Synthesis 

In April 2008, a film called Expelled: No Intelligence Allowed, starring the Hollywood 
comic actor Ben Stein, was released in theaters all over the United States. Part of the film 
was about the current scientific controversy over the origin of life. When the producers 
came to our offices to plan interviews, they told us they wanted to find a way to represent 
what DNA does visually, so that a general audience could follow the scientific discussion 
they planned to incorporate into the film. They commissioned a visually stunning 
three-dimensional animation of DNA and the inner workings of the cell and retained a 
team of molecular biologists to work closely with the animators. When the film opened in 
theaters, audiences were impressed by the beauty of the animation and the intricacy of the 
processes that it depicted. The producers have kindly made some of their animation, 
including some that didn’t make it into the film, available to me at 
www.signatureinthecell.com. This animation beautifully depicts the process by which the 
information in DNA directs the synthesis of proteins. I recommend viewing it in tandem 
with the discussion that follows. 

Though the animation in Expelled provided a visual representation of how the cell uses 
information to direct its manufacturing operations, the film producers thought it might 
also help audiences understand the process if they could see footage of an analogous 
manufacturing procedure used in many industries. “Computer-assisted design and 
manufacturing,” or “CAD-CAM” for short, uses digital information to manufacture 
various machines and products, from airplanes to automobiles to garage doors. 20 

At the Boeing plant in Seattle, engineers use CAD-CAM to direct the production of many 
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key parts of airplanes. A CAD program is used to design some part of an airplane, such 
as a wing. Engineers provide specifications that enable the CAD program to produce a 
three-dimensional drawing, a visual display of the specific part. The engineers can 
examine this visual image to check that they have achieved their design objectives. The 
CAD program also stores the infonnation for producing this visual display in binary 
code. In the CAD-CAM process, the digital information stored in the CAD program is 
then transferred to another computer program, called an NC (“numerical code”) interface. 




This program then translates the instructions from the CAD program into a machine code. 
This machine code then directs the manufacturing machinery — a robotic rivet arm, for 
example — to make the parts of the airplane. 

In the film, the producers showed a manufacturing plant operating with CAD-CAM 
technology to illustrate how the cell uses digital code to manufacture its proteins and 
protein machines. In fact, the similarities between the two processes are striking. Like a 
production facility at Boeing or Ford, the cell uses digitally encoded information to direct 
the manufacture of the parts of its machines. In addition, the process of gene expression 
involves the conversion of information from one digital format to another before the 
information is used in manufacturing. In CAD-CAM, the original digitally encoded 
information in the CAD program is translated into another machine code by the NC 
interface and then used to direct the manufacture of airplane parts. In the gene-expression 
system, the original digital information in DNA is converted into an RNA format, which 
then directs the construction of proteins. 

The system is a marvel of sophistication and merits a closer look. What follows is a short 
tour of the molecular labyrinth, the cell’s sophisticated infonnation-processing system. 
This will help to clarify what origin-of-life researchers have to explain. For those without 
a background in biology, don’t worry if some of the details get past you. The details of 
the process are fascinating, but the big picture that emerges from this overview is the key. 
The cell’s information-processing system is strikingly similar to CAD-CAM technology, 
though it differs from CAD-CAM in at least one important respect: the cell’s 
information-processing system not only produces machines, it also reproduces itself. 



Gene Expression 

Molecular biologists describe the process of protein synthesis, or “gene expression,” as a 
two-stage process of information transfer involving many smaller discrete steps and 
many molecular machines. This process proceeds as long chains of nucleotide triplets 
(the genetic message) are first copied during a process known as “transcription” and then 
transported (by the molecular messenger mRNA) to a complex organelle called a 
ribosome. At the ribosome site, the genetic message is then “translated” with the aid of a 
suite of adapter molecules called transfer RNAs to produce growing amino-acid 
chains — chains that fold into the functional proteins the cell needs to survive. Let’s look 
at each of the two stages of gene expression in turn. 



Transcription 

The first stage in the process of protein synthesis is called transcription. During 
transcription, a copy, or transcript, of the DNA text is made by a large protein complex, 
known as RNA polymerase, that moves down the DNA chain and “reads” the original 
DNA text. As RNA polymerase proceeds, it makes an identical copy of the DNA 
transcript in an RNA format. (Like DNA, RNA contains four chemical bases, called 




nucleotide bases. These bases are the same as those in DNA with one exception: RNA 
uses a base called uracil instead of thymine.) The resulting single-stranded RNA copy, or 
transcript, then moves from the chromosomes to the ribosome in the outer cytoplasm to 
begin translation, the next step in the processing of genetic information. (See Fig. 5.4.) 

Transcription can be thus described in a few simple sentences. Yet any such description 
conceals an impressive complexity. In the first place, RNA polymerase is an 
extraordinarily complex protein machine of great specificity. The RNA polymerases 
present in the simplest bacteria (. Mycoplasma ) comprise several separate protein subunits 
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with (collectively) thousands of specifically sequenced amino acids. 

RNA polymerase performs several discrete functions in the process of transcription. First, 
it recognizes (and binds to) specific regions of the DNA that mark the beginning of 
genes. Second, it unwinds (or helps unwind) the DNA text, exposing the strand that will 
serve as the template for making the RNA copy. Third, it sequesters and positions RNA 
bases (A, U, G, C) with their complementary partners on the DNA template (T, A, C, G, 
respectively). Fourth, it polymerizes or links together the separate RNA nucleotides, thus 
forming a long message-bearing ribbon of mRNA. As molecular biologist Stephen 
Wolfe explains: “The structure of the RNA polymerases reflects the complexity of their 
activities in RNA transcription. The enzymes have sites that recognize promoters, react 
with initiation, elongation and tennination factors, recognize DNA bases for correct 
pairing, bind and hydrolyze RNA nucleotides, form phospho-diester linkages, tenninate 
transcription and perhaps unwind and rewind DNA.” 2 
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Figure 5.4. The process of transcription. The top view shows a view from outside the 
RNA polymerase. The bottom view is a close-up showing (in simplified form) what is 
happening inside the RNA polymerase. 



Yet for all its complexity and specificity, RNA polymerase alone does not ensure 
accurate transcription. The process involves several other complex and functionally 
integrated parts and steps. For example, for RNA polymerase to access the genetic text, 
the DNA double helix must unwind and expose the nucleotide bases. Further, to initiate 
transcription, the RNA polymerase must bind to the correct part of the DNA sequence in 
order to begin transcribing at the beginning of a genetic message, rather than in the 
middle or at the end. For its part, the DNA text provides a promoter sequence or signal 
upstream of the actual coding sequence to facilitate recognition of the correct location by 



the RNA polymerase. Yet the RNA polymerase cannot, on its own, find this site with any 
reliability. In prokaryotes (cells without nuclei), a protein known as sigma factor 
combines with the core RNA polymerase enzyme (itself a composite of enzymes) to form 
a larger “holoenzyme.” The addition of this sigma-factor protein increases the accuracy 
of RNA polymerase-DNA binding by roughly a million times, making recognition of 
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promoter sequences and thus accurate transcription possible. 

Transcription is a highly regulated process. By binding to specific sites on the DNA, 
various proteins will either inhibit or promote the transcription of particular genes in 
response to the varying needs of the cell. For example, when bacteria have no lactose 
available to metabolize, the protein “lactose repressor” binds near the gene that produces 
proteins for consuming or metabolizing lactose. This prevents RNA polymerase from 
transcribing the gene and producing an unnecessary protein. When lactose enters the cell, 
however, a chemical derivative of lactose binds to the lactose repressor, producing a 
change in the repressor protein that causes the repressor to fall off the gene, allowing the 
gene to be transcribed and the protein for metabolizing lactose to be synthesized. By 
regulating transcription, repressor and activator proteins ensure that the cell maintains 
appropriate levels of proteins. Thus, even in prokaryotic organisms, many separate 
proteins are necessary to facilitate — and regulate — transcription. 
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Figure 5.5. Parts of the RNA polymerase holoenzyme (sigma factor not pictured). 



In eukaryotic cells (cells with nuclei), the process of transcription is considerably more 
complex. 28 Here recognition is promoted by a massive complex of several necessary 



initiation factors, enzymes that, like a jigsaw puzzle, fit together with each other, with the 
promoter sequence on the DNA molecule, and with the RNA polymerase (see Fig. 5.5). 
After transcription, the mRNA transcript must be heavily edited by still other proteins 
before transport to the ribosome site for translation. (In prokaryotes, this editing process 
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takes place to a much lesser, but still necessary degree; see Fig. 5.6). 

As it turns out, the original DNA text in eukaryotic organisms has long sections of text 
called “introns” that do not (typically) encode proteins. Although these introns were once 
thought to be nonfunctional “junk DNA,” they are now known to play many important 
functional roles in the cell. 30 (I discuss some of these newly discovered functions of 
introns and other types of nonprotein coding DNA in Chapter 18 and the Epilogue.) In 
any case, the initial transcript, being a copy of the DNA, also contains sections of text 
that do not encode proteins, but that are interspersed with coding text. To excise these 
noncoding regions before translation, the cell must edit the initial mRNA transcript so 
that only coding regions remain. This process requires the existence of other specific 
enzymes — exonucleases, endonucleases, and splicesomes, for example — that can 
correctly identify and excise the nonprotein coding text from the initial RNA transcript 
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and then splice together the resulting fragments of coding text in the correct order. 
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Figure 5.6. The initiation phase of transcription showing a promoter sequence (TATA), 
the various transcription factors that help position the RNA polymerase correctly along 
the DNA and the RNA polymerase enzyme. 



Thus, in both prokaryotes and eukaryotes, transcription constitutes a complex, 
functionally integrated process involving several specialized and individually necessary 
proteins. Yet production of each of these separate proteins is itself dependent on the very 



process of transcription that they make possible. To build RNA polymerase, for example, 
the cell must transcribe the genetic texts with the instructions for building RNA 
polymerase. Yet to transcribe this infonnation requires RNA polymerase. The 
information necessary to manufacture RNA polymerase and all the other associated 
enzymes and protein cofactors of transcription is stored on the DNA template. But 
expressing that information on the DNA template for building the proteins of the 
transcription system requires most of the proteins of the transcription system. 



Translation 

The next step in gene expression, called translation, exhibits even greater integrated 
complexity. Whereas transcription makes a single-stranded copy — a transcript — of DNA 
in an RNA format, translation uses that infonnation to build a protein. Since many 
biologists think of protein molecules themselves as information-rich molecules 
constructed from a twenty-character amino-acid alphabet, they think of the process of 
protein synthesis as a process of translating information from the four-character alphabets 
of DNA and RNA into the twenty-character amino-acid alphabet; hence the name 
“translation.” 

Even in the simplest prokaryotic cells, the process of translation utilizes many dozens of 
separate proteins or protein machines, each one of which is produced during 
translation. After the messenger RNA reaches the outer cytoplasm, it arrives at the site 
of a large chemical-processing unit known as a ribosome, the site of protein synthesis. In 
prokaryotes, the ribosome alone contains fifty separate proteins and three long RNA 
molecules combined in two distinct but associated subunits. 

The process of translation begins as the ribosome subunits dissociate and the messenger 

RNA (mRNA) binds to the smaller of the two subunits (see Fig. 5.7). Auxiliary proteins 

known as initiation factors catalyze this disassociation and temporarily stabilize the 

second subunit in its disassociated state. At the same time, a group of three RNA bases on 

a transfer-RNA (tRNA) molecule binds to the first triplet of RNA bases on the mRNA 

molecule as it docks in the ribosome. The groups of three bases on mRNA are called 

codons or triplets. The groups of three bases to which they bind on the tRNA are called 

anticodons. The sequence AUG constitutes the “initiator codon” at the head of the mRNA 
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transcript. 

After the initiator codon (AUG) on the mRNA transcript binds to the anticodon triplet on 
the corresponding tRNA, then the second and larger subunit of the ribosome rejoins the 
first, forming a large complex of molecules including both ribosomal subunits, the 
mRNA, and a tRNA molecule carrying its corresponding amino acid. The protein chain 
can now begin to form. An additional amino acid-tRNA combination (known as an 
aminoacyl-tRNA molecule) binds to a second and adjacent active site on the ribosome, 
bringing its amino acid into close proximity to the first. A protein within the ribosome 
known as a peptidyl transferase then catalyzes a polymerization (linking) reaction 
involving the two (tRNA-borne) amino acids. In the process, the first amino acid 




detaches from its tRNA and attaches to the second amino acid, fonning a short dipeptide 
chain. The ribosome then ejects the first and empty molecule and moves along to “read” 
the next triplet of bases on the mRNA. Another tRNA-amino acid carrier pairs with the 
next mRNA codon, bringing a new amino acid into close proximity with the growing 
chain, and the process repeats itself until the signal for termination is reached on the 
mRNA. Then a protein termination factor, rather than an aminoacyl tRNA, binds to the 
second ribosome site and catalyzes hydrolysis (dissolution) of the bond holding the 
peptide chain to the tRNA at the first ribosome site. The newly assembled protein then 
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detaches. 
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Figure 5.7. The translation of messenger RNA at the ribosome resulting in a growing 
amino acid chain. 



At each step in the translation process, specialized proteins perfonn crucial functions. For 
example, the initial coupling of specific amino acids to their specific tRNA molecules 
(Crick’s adapters) depends upon the catalytic action of twenty specific enzymes, one for 
each tRNA-amino acid pair. The integrity of the genetic code depends upon the specific 
properties of these enzymes, kn own as aminoacyl-tRNA synthetases. 36 

These synthetases are necessary because, as Francis Crick anticipated, there is nothing 
about the chemical properties of the bases in DNA (or those in mRNA) that favors 
forming a chemical bond with any specific amino acid over another. In fact, the 
cloverleaf-shaped tRNA molecule attaches to the mRNA transcript on one end and 
carries a specific amino acid on the other. Figure 5.8 shows that the amino acid and the 
codon-anticodon pairs are at opposite ends of the tRNA molecule. This distance ensures 
that neither the codons on mRNA nor the anticodons on tRNA interact chemically with 
the amino acids. As Crick anticipated, direct chemical interactions between bases 



(codons) and amino acids do not determine the assignments that constitute the genetic 
code. 







Figure 5.8. The transfer-RNA molecule showing the anticodon on one end of the 
molecule and the amino acid attachment site on the other. 

Instead, these associations are mediated indirectly by the enzymatic action of the 
aminoacyl-tRNA synthetases. The synthetases have several active sites that enable them 
to: (1) recognize a specific amino acid, (2) recognize a specific corresponding tRNA 
(with a specific anticodon), (3) react the amino acid with ATP (adenosine triphosphate) to 
form an AMP (adenosine monophosphate) derivative, and then, finally, (4) link the 
specific tRNA molecule in question to its corresponding amino acid. Current research 
suggests that the synthetases recognize particular three-dimensional or chemical features 
(such as methylated bases) of the tRNA molecule. In virtue of the specificity of the 
features they must recognize, individual synthetases have highly distinctive shapes that 
derive from specifically arranged amino-acid sequences. In other words, the synthetases 
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are themselves marvels of specificity. 

For their part, ribosomes must also perform many functions. These include: (1) enhancing 
the accuracy of codon-anticodon pairing between the mRNA transcript and the 
aminoacyl-tRNAs, (2) polymerizing (via peptidyl transferase) the growing peptide chain, 
(3) acting as energy transducers converting chemical energy into the mechanical energy 
during translocation of amino acids from tRNA carriers, (4) protecting the growing 
protein from attack by proteases (protein-degrading enzymes) possibly by forming a long 



protective tunnel, and (5) assisting in the hydrolysis (dissolution) of the amino 
acid-tRNA bond during termination. Further, several separate protein factors and 
cofactors facilitate various specialized chemical transformations during the three discrete 
steps of translation: initiation, elongation, and tennination. In eukaryotes, initiating 
translation alone requires a dozen separate protein cofactors. In prokaryotes, for each of 
the three steps of translation, three specialized protein cofactors perform specific (and in 
several cases necessary) functions. Thus, here, as in the transcription system, 
origin-of-life researchers find themselves confronted by a chicken-egg problem. 



Making Copies 

Besides transcribing and translating, the cell’s information-processing system also 
replicates DNA. This happens whenever cells divide and copy themselves. As with the 
processes of transcription and translation, the process of DNA replication depends on 
many separate protein catalysts to unwind, stabilize, copy, edit, and rewind the original 
DNA message. In prokaryotic cells, DNA replication involves more than thirty 
specialized proteins to perform tasks necessary for building and accurately copying the 
genetic molecule. These specialized proteins include DNA polymerases, primases, 
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helicases, topoisomerases, DNA-binding proteins, DNA ligases, and editing enzymes. 
DNA needs these proteins to copy the genetic infonnation contained in DNA. But the 
proteins that copy the genetic infonnation in DNA are themselves built from that 
information. This again poses what is, at the very least, a curiosity: the production of 
proteins requires DNA, but the production of DNA requires proteins. 

To complicate matters further, proteins must catalyze formation of the basic building 
blocks of cellular life such as sugars, lipids, glycolipids, nucleotides, and ATP (adenosine 
triphosphate, the main energy molecule of the cell). Yet each of these materials is also 
constructed with the help of specific enzymes. For example, each of the systems involved 
in the processing of genetic infonnation requires energy at many discrete steps. In the 
cell, ATP (adenosine triphosphate) or a similar molecule (GTP, guanosine triphosphate) 
supplies this energy whenever one of its three phosphates are cleaved during a reaction 
known as “hydrolysis.” The cell manufactures ATP from glucose by a process known as 
glycolysis. Yet glycolysis involves ten discrete steps each catalyzed by a specific protein. 
These proteins (e.g., hexokinase, aldolase, enolase, pyruvate kinase) are, in turn, 
produced from genetic infonnation on DNA via the processes of transcription and 
translation. Thus, the infonnation-processing system of the cell requires ATP, but ATP 
production (via glycolysis) requires the cell’s information-processing system, again 
forming a “closed loop.” Indeed, it even takes ATP to make ATP during glycolysis. 
Figure 5.9 shows some of the functionally interdependent relationships among the 
systems of proteins that comprise the cell’s information-processing system. 





Figure 5.9. Schematic of the functional integration of the prokaryotic 
information-processing system listing the number of proteins involved in each process. 



Integrated Complexity and the Origin of Life 

Following the elucidation of the structure and function of DNA during the 1950s and 
early 1960s, a radically new conception of life began to emerge. Not only did molecular 
biologists discover that DNA carried information; they soon began to suspect that living 
organisms must contain systems for processing genetic information. Just as the digital 
information stored on a disc is useless without a device for reading the disc, so too is the 
information on DNA useless without the cell’s information-processing system. As 
Richard Lewontin notes, “No living molecule [i.e., biomolecule] is self-reproducing. 
Only whole cells may contain all the necessary machinery for self-reproduction.... Not 
only is DNA incapable of making copies of itself, aided or unaided, but it is incapable of 
‘making’ anything else.... The proteins of the cell are made from other proteins, and 
without that protein-forming machinery nothing can be made.” 40 

Crick was right: the cell contains not just molecular repositories of genetic information, 
but a code for translating the information in the DNA molecule (and its RNA transcript) 
into the construction of a protein. But this requires some physical medium of information 
transfer. After Crick and others realized that this transfer is not achieved via the direct 
attraction of amino acids to individual nucleotide bases or groups of bases — as Gamow 
had proposed — it became evident that the transcription and translation of genetic 
information is mediated by a complex information-processing system composed of many 
types of nucleic acids (such as mRNAs and tRNAs) and many specific enzymes. 



These and other developments in molecular biology since the 1960s have shown that the 
information-processing system of the cell depends on a “tightly integrated” system of 
components — indeed, a system of systems. Both the transcription and translation systems 
depend upon numerous proteins, many of which are jointly necessary for protein 
synthesis to occur at all. Yet all of these proteins are made by this very process. Proteins 
involved in transcription such as RNA polymerases, for example, are built from 
instructions carried on an RNA transcript. Translation of the RNA transcript depends 
upon other specialized enzymes such as synthetases, yet the information to build these 
enzymes is translated during the translation process that synthetases themselves facilitate. 

Biochemist David Goodsell describes the problem, “The key molecular process that 
makes modem life possible is protein synthesis, since proteins are used in nearly every 
aspect of living. The synthesis of proteins requires a tightly integrated sequence of 
reactions, most of which are themselves perfonned by proteins.” 41 Or as Jacques Monod 
noted in 1971: “The code is meaningless unless translated. The modern cell’s translating 
machinery consists of at least fifty macromolecular components which are themselves 
coded in DNA: the code cannot be translated otherwise than by products of 
translation.’’’’ (Scientists now know that translation actually requires more than a 
hundred proteins.) 43 

The integrated complexity of the cell’s information-processing system has prompted 
some profound reflection. As Lewontin asks, “What makes the proteins that are necessary 
to make the protein?” 44 As David Goodsell puts it, this “is one of the unanswered riddles 
of biochemistry: which came first, proteins or protein synthesis? If proteins are needed to 
make proteins, how did the whole thing get started?” 45 The end result of protein 
synthesis is required before it can begin. 

The interdependence of proteins and nucleic acids raises many obvious “chicken and 
egg” dilemmas — dilemmas that origin-of-life theorists before the 1960s neither 
anticipated nor addressed. The cell needs proteins to process and express the information 
in DNA in order to build proteins. But the construction of DNA molecules (during the 
process of DNA replication) also requires proteins. So which came first, the chicken 
(nucleic acids) or the egg (proteins)? If proteins must have arisen first, then how did they 
do so, since all extant cells construct proteins from the assembly instructions in DNA. 
How did either arise without the other? 

As the late British philosopher Sir Karl Popper mused, “What makes the origin of life and 
the genetic code a disturbing riddle is this: the code cannot be translated except by using 
certain products of its translation. This constitutes a really baffling circle: a vicious circle 
it seems, for any attempt to form a model, or a theory, of the genesis of the genetic 
code.” 46 Even the simplest living things operate as complex systems of multiple 
interdependent parts. Yet how this feature of life could have arisen is, as Jacques Monod 
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put it, “exceedingly difficult to imagine.” 




The Question Redefined 



The picture of the cell provided by modern molecular biology has led scientists to 
redefine the question of the origin of life. The discovery of life’s information-processing 
systems, with their elaborate functional integration of proteins and nucleic acids, has 
made it clear that scientists investigating the origin of life must now explain the origin of 
at least three key features of life. First, they must explain the origin of the system for 
storing and encoding digital information in the cell, DNA’s capacity to store digitally 
encoded information. Second, they must explain the origin of the large amount of 
specified complexity or functionally specified information in DNA. Third, they must 
explain the origin of the integrated complexity — the functional interdependence of 
parts — of the cell’s information-processing system. 

Later chapters in the book (8 through 16) describe the various attempts scientists have 
made to explain the origin of the information and information-processing system that the 
cell requires to maintain itself. As we will see, there have been three basic approaches for 
explaining the informational properties of life. The first relies heavily on random 
molecular interactions (chance). The second explains by reference to lawlike forces 
(necessity). The third approach combines chance and necessity. But before I investigated 
these possible explanations in detail I studied more about what scientists in the past 
thought about questions of origins and about how scientists in the present investigate 
them. I did this in part because I wanted to know if there was a fourth option — the design 
hypothesis — that could be legitimately considered as a possible scientific explanation for 
the DNA enigma. 
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The Origin of Science and the Possibility of Design 



At the beginning of my graduate studies, I did not know the extent of the cell’s 
complexity and, therefore, did not fully appreciate the challenge facing origin-of-life 
research. But as I was learning more about various origin-of-life scenarios, I also began 
to learn everything I could about molecular biology and the extraordinary discoveries that 
had taken place in that field during the preceding thirty-five years. Fortunately, the 
Cambridge system allowed graduate students to attend lectures in various disciplines, and 
our supervisors in the history and philosophy of science particularly encouraged us to 
attend lectures in scientific fields relevant to our topics. I found the challenge of retooling 
in molecular biology exciting and was further aided in my quest by a judicious choice of 
new friends, several of whom were molecular biologists working at the Medical Research 
Council Laboratory, including two American postdoctoral researchers who shared my 
interest in the origin of life and who more or less willingly indulged my persistent 
inquiries about the inner workings of the cell. 

As I learned more about the chemistry of DNA, the sequence hypothesis, the 
gene-expression system, and what molecular biology generally had to say about the 
origin-of-life problem, I also learned more about how Watson and Crick had made their 
famous discovery. One day, Maurice Wilkins, the crystallographer who had received the 
Nobel Prize along with Watson and Crick for his work on the double helix, came to 
lecture in our department. In his talk, Wilkins told about his sometimes tense working 
relationship with Rosalind Franklin, the importance of her work, his encounters with 
Watson and Crick, and the key scientific insights that led to their eventual discovery. He 
was personally modest about the part he played in this historical achievement and clearly 
energized by the opportunity to relive the excitement of the discovery, although he did 
comment, rather plaintively, that if only he had had a bit more time, he might himself 
have solved the structure first. 

Afterward, several scholars questioned Wilkins about the ethics of Watson and Crick’s 
use of what they had learned from Wilkins about Franklin’s X-ray images in order to 
determine the structure of DNA. Wilkins’s response was interesting. He acknowledged 
that Franklin, who died before the Nobel Prize was awarded, had not received her share 
of credit for the important discovery, but he defended his rivals from Cambridge for their 
willingness and ability to synthesize evidence from diverse sources and disciplines in 
order to put all the pieces of a puzzle together into a coherent new picture of reality. 

I had a chance to talk with Wilkins afterward and his defense of Watson and Crick left 
me with a key insight about how science works. Many people think that scientists spend 



all their time doing experiments in the laboratory in order to test hypotheses. The 
discovery of the structure of DNA showed that science certainly involves careful 
laboratory work. Rosalind Franklin’s painstaking collection and analysis of X-ray data 
played an indispensable role in the discovery of the double helix, one for which she 
doubtless did not receive sufficient credit. But the discovery of the double helix showed 
that science also depends upon the synthesis of diverse lines of evidence in order to make 
sense of the big picture. In this, Watson and Crick excelled. Seen in this light, there was 
more method — more scientific method — to their mad dash to collect and synthesize all 
available data than some of their critics have allowed. 

Though Watson and Crick were relatively unknown and certainly undercredentialed, they 
had solved one of the great scientific mysteries of the ages. Moreover, they achieved this 
feat not by working their way up through the establishment, which typically involves 
publishing a series of narrowly focused technical papers based on their own experimental 
research, but by explaining an array of preexisting evidence in a new and more coherent 
way. Could a similar approach help to crack the other mystery surrounding DNA, the 
mystery of how the digital infonnation in the molecule had originated in the first place? 

As I addressed these questions I found Watson and Crick’s example instructive and 
encouraging. I was not an experimentalist, but a former applied scientist and philosopher 
of science. In my investigations of the DNA enigma, I began to marshal every relevant 
intellectual resource and insight — scientific, historical, mathematical, and 
philosophical — that I could. Watson and Crick showed that science involves such 
interdisciplinary activity. Many scientists working on the origin of life are particularly 
aware of this. As Freeman Dyson, a Princeton physicist who has written extensively on 
the origin of life, explains, “The origin of life is one of the few scientific problems which 
is broad enough to make use of ideas from almost all scientific disciplines.” 1 

Unfortunately, this understanding of how science works has not penetrated public 
understanding. Nor do scientists themselves always acknowledge it. Years later, as those 
of us in the intelligent design (ID) research community began to advance what we 
regarded as a new synthetic understanding of the origin and nature of life, we were 
repeatedly maligned for not “doing science.” In 2004-5, as the program I led at 
Discovery Institute suddenly found itself at the center of a frenzy of hostile media 
coverage, reporters kept repeating the same criticism, namely, “ID advocates aren’t really 
scientists, because they don’t do any experiments of their own.” Reporters would then 
demand to see our laboratory, as if doing experiments was the only kind of activity that 
scientists pursued. 

At the time, I knew there were scientists in labs around the world who supported the 
theory of intelligent design. Some of these scientists had done experiments testing 
different aspects of the theory, sometimes with our financial support. Beyond that, we 
actually had started our own laboratory, though we had chosen to keep it out of public 
view initially to protect some scientists and sensitive projects from premature attempts to 
discredit them. Later when we publicly unveiled a lab, called Biologic Institute, even our 




critics acknowledged that our scientists were addressing significant scientific questions, 
though many still attempted to stigmatize Biologic’s experimental research as 
“religiously motivated.” In any case, because of my role overseeing ID research at 
Discovery Institute, I knew firsthand that this critique of the theory of intelligent design 
was not accurate: many ID scientists do in fact work in labs and conduct experiments. 

But as Watson and Crick’s discovery showed, even if advocates of ID weren’t doing 
experiments in labs — as I personally was not — it didn’t follow that we weren’t “doing 
science.” To say otherwise betrayed a blinkered view of the scientific enterprise. Watson 
and Crick perfonned many experiments during their long careers. But the work for which 
they are best known came as the result of building models based on data they acquired 
almost exclusively from other sources — from scientific journals, other scientists, and 
other labs. 

Many of the great discoveries in science were achieved not just by experimentalists who 
produced new factual knowledge, but by theoreticians who taught us to think differently 
about what we already know. Examples of this kind of scientific work leaped to mind: 
Copernicus’s De Revolutionibus orbium coelestrium, Newton’s Principia, and the papers 
Einstein produced in his annus mirabilis, his miracle year of 1905. While working as a 
patent clerk without access to any experimental apparatus, Einstein rethought the whole 
framework of modern physics and, in the process, explained many previously 
confounding factual anomalies. 

Charles Darwin also did little experimental science. He did make several descriptive 
studies of barnacles and wonns and some experimental studies about how species spread 
through seed dispersal and other processes. Yet his masterpiece, On the Origin of Species 
by Means of Natural Selection, contains neither a single mathematical equation nor any 
report of original experimental research. Yet he formulated a great scientific theory. He 
did this by drawing together disparate lines of observational evidence and presenting an 
argument for a novel interpretation of that evidence. Of course Darwin made some of the 
observations that supported his theory himself, but even if he had not done so, this would 
have hardly detracted from his theory. Darwin’s method of investigation typified that of 
many other historical scientists who functioned more like detectives solving a mystery by 
collecting clues and developing a case than like stereotypical experimental scientists who 
test hypotheses under carefully controlled laboratory conditions. 

But I had other reasons for liking the story of Watson and Crick, and particularly the 
story of their encounter with Erwin Chargaff, in front of whom they had grievously 
embarrassed themselves in their headlong attempt to acquire infonnation about the 
chemical structure of the mystery molecule. During my years in Cambridge and even 
after, I had a number of similar experiences — although I can’t say I wasn’t warned. 

One day I attended a spellbinding lecture in my department by a visiting scholar who was 
perhaps the world’s leading authority on Immanuel Kant. During the ensuing discussion, 
I asked him if he would direct me to a good source whereby I could leam more about a 




particular aspect of Kant’s philosophy that he had just discussed. He answered me 
graciously enough, but I had a vague sense from others in the room that I may have 
created an awkward moment. As I was leaving the lecture hall, the head of my 
department, Michael Hoskin, a kindly Cambridge don of the old school, gently pulled me 
aside. “Meyer,” he intoned in his high Oxbridge accent, “I know that in the States you’ve 
learned that the only stupid question is the one you don’t ask. But it’s different here. 
Everyone here is bluffing, and if you’re to succeed, you must learn to bluff too. So never 
admit you haven’t read something. And in future, if you have a question that reveals 
ignorance, please, come ask me privately.” 

Professor Hoskin’ s advice was no doubt good, but I was temperamentally disinclined to 
follow it. As I met Cambridge scientists, philosophers, and fellow graduate students over 
high-table dinners or in common-room discussions, I often shared my interest in the DNA 
enigma. When I met someone with relevant expertise, I would probe for insights. Often, I 
learned things as a result, but sometimes I felt the sting of embarrassment at having asked 
one too many questions. 

Foolishly perhaps, I also sometimes let it slip that I was interested in the question of 
design and, specifically, in the question of whether a rigorous scientific argument for 
design could be reformulated based upon the discovery of the information-bearing 
properties of DNA. It did not take me long, naive American though I was, to understand 
that this was not always a welcome turn in the conversation. Sometimes fellow graduate 
students would warn me about not venturing into such a dangerous topic, though more 
often I received blank and impassive stares. Cambridge was, after all, Darwin’s alma 
mater, and the design hypothesis had not been seriously debated in biology for almost a 
hundred and fifty years. 

Oddly, I did find willingness to discuss these topics among some of the more prominent 
scientists whom I had the good fortune to meet. The year before coming to Cambridge, I 
had the opportunity to interview the Nobel Prize-winning physicist Eugene Wigner at a 
conference at Yale. I asked him directly about the origin-of-life problem, and he 
explained why he thought the odds were overwhelmingly against any process of 
undirected chemical evolution producing life. Wigner further expressed openness to, even 
sympathy for, the idea that the infonnation present in DNA pointed to an intelligent 
source. 

Upon arriving in Cambridge I had a few similar experiences, but again only with very 
prominent scientists, those who, perhaps, felt secure enough in their reputations to 
entertain heterodox ideas openly. Once, before attending a high-table dinner at Churchill 
College as the guest of Professor Hoskin, a charming Hungarian man broke with 
Cambridge etiquette and sat down crossed-legged in front of me on the carpet in the 
senior common room. He introduced himself merely as Herman and began asking me 
questions about my research interests. As I explained them, he expressed his fascination 
with my ideas about DNA and shared some of his own questions about the origin-of-life 
problem. Later, as we resumed our conversation after a dinner in which anti-American 




invective was flowing as freely as the port and madeira, Professor Hoskin circled around 
to make sure that we had made proper introductions. 

“Oh, Bondi,” he said, “I’m delighted to see that you’ve met young Meyer.” I gulped as I 
suddenly realized who it was that I had been regaling with my various musings about the 
origin of life. “Herman” was Sir Professor Herman Bondi, the mathematician and 
cosmologist who had, with Sir Fred Hoyle, formulated the steady-state theory of the 
universe and who was also famous for his work on general relativity. Beyond that, I knew 
that Bondi was a well-known secular humanist. He had every philosophical inclination to 
dismiss my ideas out of hand. Yet there he sat, cross-legged, in front of a graduate 
student, openly discussing a radical proposal for solving the problem of the origin of life. 
I’ll never forget him. 

Later Bondi’s collaborator, Fred Hoyle, also came to Cambridge to discuss his views on 
the origin of life. This was a lecture I couldn’t miss. Hoyle was famous for his correct 
prediction of the resonance levels of the carbon atom and for his pioneering work on the 
fine-tuning problem in physics and cosmology. But he hadn’t come to Cambridge to 
discuss any of this. He was here to explain why he had come to reject chemical 
evolutionary theory. Afterward I asked him directly about whether he thought the 
information stored in DNA might point to an intelligent source. His eyes brightened, and 
he motioned me to continue walking with him after his lecture. “That would certainly 
make life a lot easier to explain,” he said. 



Modern Cambridge, Ancient Cambridge 

Apart from a few such encounters, contemporary Cambridge did not share my interest in 
design. Nevertheless, it didn’t take me long to find that ancient Cambridge did. 
Cambridge was world-famous as a city where the history of science was not just studied, 
but made. I soon discovered that those who had established the scientific enterprise and, 
for several hundred years, made its history did not share the disdain for the design 
hypothesis that I had encountered. The founders of the scientific revolution (ca. 
1300-1700) were often deeply religious men who expressed a profound appreciation for 
the design of life and the universe. Moreover, for these scientists, the concept of design 
was not just a pious sentiment. For them it was an indispensable assumption upon which 
the whole of the scientific enterprise rested. 

As many historians of science have noted, the founders of modem science needed to 
assume that if they studied nature carefully, it would reveal its secrets. Their confidence 
in this assumption was grounded in the Greek and Judeo-Christian idea that the universe 
is an orderly system — a cosmos, not a chaos. As the British philosopher Alfred North 
Whitehead explained, “There can be no living science unless there is a widespread 
instinctive conviction in the existence of an Order of Things. And, in particular, of an 
Order of Nature .” Whitehead argued that confidence in this proposition was especially 
inspired by the “medieval insistence upon the rationality of God.” 




Other historians of science have amplified Whitehead’s observation. They have insisted 
that modern science was specifically inspired by the conviction that the universe is the 
product of a rational mind who designed the universe to be understood and the human 
mind to understand it. As sociologist of science Steve Fuller notes, Western science is 
grounded in “the belief that the natural order is the product of a single intelligence from 
which our own intelligence descends .” 5 This foundational assumption gave rise to the 
idea that nature was “intelligible,” that it had been designed in accord with discernable 
laws that could be understood by those who subjected nature to careful scrutiny. Or as the 
astronomer Johannes Kepler said, scientists have the job of “thinking God’s thoughts 
after him .” 6 

Though Greek, Jewish, and Christian philosophers agreed about the rationality of nature, 
they did not necessarily agree about how to discover it. Many historians of science have 
noted that Greek ideas about nature tended to induce a sterile annchair philosophizing 
unconstrained by actual observations. For science to advance, it needed to develop a 
more empirical approach. This began to occur during the scientific revolution after 
medieval philosophers and the early modem scientists made a decisive break with one 
aspect of Greek thinking. 

Although the Greek philosophers thought that nature reflected an underlying order, they 
nevertheless believed that this order issued not from a designing mind, but from an 
underlying and self-evident logical principle. For this reason, many assumed that they 
could deduce how nature ought to behave from first principles without actually observing 
nature. In astronomy, for example, the Greeks (Aristotle and Ptolemy) assumed that 
planets must move in circular orbits. Why? Because according to the Greek cosmology, 
the planets moved in the “quintessential” realm of the crystalline spheres, a heavenly 
realm in which only perfection was possible. Since, they deduced, the most perfect form 
of motion was circular, the planets must move in circular orbits. What could be more 
logical? 

The idea of design helped liberate Western science from such fact-free reasoning. Like 
the Greek philosophers, the early modem scientists thought that nature exhibited an 
underlying order. Nevertheless, they thought this natural order had been impressed on 
nature by a designing mind, in particular, the mind of the Judeo-Christian God. For this 
reason they thought that the order in nature was the product not of logical necessity, but 
of rational deliberation and choice. Because nature had been designed by the same mind 
that had designed the human mind, the early scientists assumed that nature was 
intelligible, that it could be understood by the human intellect. But because the order in 
nature was also contingent on the will of its designer, they realized they had to observe 
nature in order to get it to reveal its secrets. 

Just as there are many ways to paint a picture or design a clock or organize the books in a 
library, there were many ways to design and organize a universe. Because it had been 
chosen by a rational mind, the order in nature could have been otherwise. Thus, the 
natural philosophers could not merely deduce the order of nature from logical first 




principles; they needed to observe nature carefully and systematically. As the chemist 
Robert Boyle explained, the job of the natural philosopher (the scientist) was not to ask 
what God must have done, but (as far as possible) to inquire into what God actually did. 
Boyle argued that God’s absolute freedom as designer and creator requires of us an 
empirical and observational, not a deductive, approach to the natural order: 
look — observe — to find out. 

Thus, the assumption that a rational mind had designed the universe gave rise to two 
ideas — intelligibility and contingency — which, in turn, provided a powerful impetus to 
study nature and to feel confident that such study would yield understanding. As the 
Oxford physicist and historian of science Peter Hodgson observes: “According to 
Judeo-Christian beliefs the world is the free creation of God from nothing. The structure 
of the world cannot therefore be deduced from first principles; we have to look at it, to 
make observations and experiments to find out how God made it. This reinforces the 
Aristotelian principle that all knowledge comes through the senses, but requires that it be 
situated within a wider set of beliefs concerning the nature of the world that is implicit in 
the doctrine of creation.” 8 Hodgson notes that early scientists assumed that the world was 
both rational — because it was created by a Mind — and contingent — because that Mind 
had acted freely. These assumptions led to “a fresh style of scientific thinking,” one that 
“was made possible by the Judeo-Christian vision of the world.” 9 

Everywhere I went in the city of Cambridge, I encountered evidence of this long 
dominant viewpoint. Each day as I walked to my department on Free School Lane, I 
passed by the entrance to the old Cavendish Laboratory in which thirty-odd years before 
Francis Crick and James Watson realized that their model of DNA was so beautiful it had 
to be right. 10 On the archway across the great wooden door of the Cavendish was a Latin 
inscription that reads, Magna opera Domini. Exquista in omnes voluntates ejus. The 
inscription had been placed there at the insistence of the physicist James Clark Maxwell, 
the first Cavendish professor in 1871. The inscription quotes a Psalm that reads, “Great 
are the works of the Lord, sought out by all who take pleasure therein.” The inscription 
summarized Maxwell’s inspiration for scientific study: the thought that works of nature 
reflect the work of a designing mind. In this belief he had been joined by many of the 
leading scientists of Western civilization for over four hundred years — Copernicus, 
Kepler, Ray, Linnaeus, Curvier, Agassiz, Boyle, Newton, Kelvin, Faraday, 
Rutherford — on and on the list could go. 

As I studied the history of science, I soon discovered, however, that many of these 
scientists did not just assume or assert by faith that the universe had been designed; they 
also argued for this hypothesis based on discoveries in their disciplines. Johannes Kepler 
perceived intelligent design in the mathematical precision of planetary motion and the 
three laws he discovered that describe that motion. Other scientists perceived design in 
many of the structures or features of the natural world upon which the laws of nature 
operated. Louis Agassiz, the leading American naturalist of the nineteenth century, for 
whom the Agassiz Chair is named at Harvard, believed that the patterns of appearance in 
the fossil record pointed unmistakably to design. 1 1 Carl Linnaeus argued for design based 




upon the ease with which plants and animals fell into an orderly groups-within-groups 
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system of classification. Robert Boyle insisted that the intricate clocklike regularity of 
many physical mechanisms suggested the activity of “a most intelligent and designing 
agent.” 13 

Newton, in particular, was noteworthy in this regard. As I discussed in Chapter 1, he 
made specific design arguments based upon discoveries in physics, biology, and 
astronomy. He argued for the intelligent design of the eye in his classic work, Opticks. He 
also argued for the intelligent design of the planetary system in his masterpiece, The 
Mathematical Principles of Natural Philosophy (often cited in brief by part of its Latin 
title, the Principia). 4 Writing in the General Scholium (introduction) to the Principia, 
Newton suggested that the stability of the planetary system depended not only upon the 
regular action of universal gravitation, but also upon the precise initial positioning of the 
planets and comets in relation to the sun. As he explained: “Though these bodies may, 
indeed, persevere in their orbits by the mere laws of gravity, yet they could by no means 
have, at first, derived the regular position of the orbits themselves from those laws.... 
[Thus] this most beautiful system of the sun, planets, and comets could only proceed 
from the counsel and dominion of an intelligent and powerful being.” 15 

So central was the idea of design to Newton’s scientific perspective that I was repeatedly 
cautioned not to miss it as I researched Newton’s ideas about gravity in the Principia. In 
one tutorial, one of my supervisors, a staunch atheist, reminded me not to read modern 
secular biases into my interpretation of Newton’s work. “If you miss Newton’s theism,” 
he exhorted, “you’ve missed everything.” 

What I learned about Newton would come in handy years later. In 1999, I was asked to 
testily at a hearing of the U.S. Commission on Civil Rights about the alleged viewpoint 
discrimination in the teaching of biological origins. My opposing expert witness at this 
hearing was Eugenie Scott, an anthropologist who heads the National Center for Science 
Education. Scott had distinguished herself as an ardent critic of the scientific legitimacy 
of intelligent design and has consistently argued that the very idea of intelligent design 
violates the rules of science. According to Scott, these rules preclude invoking any 
nonnaturalistic or nonmaterialistic causes (such as the action of a designing intelligence) 
to explain features of the natural world. 

At the hearing, one of the commissioners asked me an interesting question. He wondered, 
given Newton’s views on intelligent design in physics and Dr. Scott’s widely accepted 
definition of science, whether Newtonian physics would qualify as “scientific” in our 
present educational climate. Eugenie Scott responded by insisting that the commissioner 
had misunderstood the context in which Newton discussed intelligent design. She 
explained that, for Newton, intelligent design was merely a theological belief — one that 
he took pains to keep separate from his science so as to avoid subjecting it to empirical 
test. Here’s what she said according to the transcript of the hearing: “Newton made a very 
clean distinction — as a very religious scientist — he made a very clear distinction about 
how science should work. Newton’s view was that we should understand the natural 




world solely by using natural processes. And he said this for religious reasons because he 
didn’t want God’s existence or God’s transcendence, shall we say, to be tested by the 
base methods of science.” Scott then proceeded to reassert that invoking intelligent 
design would violate the central rule of science, namely, as she put it, that “we explain 
the natural world restricting ourselves only to natural cause .” 16 

As it happened, I had just finished an essay on the history of scientific design arguments 
and had quoted Newton’s argument in the Principia — an indisputably scientific 
treatise — at length. The essay was at that moment sitting in my briefcase. As the 
transcript of the hearing makes clear, I immediately challenged Scott’s portrayal of 
Newton. “The historical point on Newton, I’m afraid, is just simply incorrect. If one 
opens the General Scholium, the introduction to the Principia, arguably the greatest book 
of science ever written, one finds an exquisite design argument by Newton in which he 
makes clear that the arrangements of the planets can only be explained not, he says, by 
natural law, but only by the contrivance of a most wise artificer. He’s very explicit about 
this. This is in the introduction to his magnum opus.” Several of the commissioners 
appeared intrigued, so I pressed on, noting that Newton was not alone in his formulation 
of such design arguments. I continued, “You find these kinds of design arguments all 
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throughout the scientific revolution, from Boyle, from Kepler, from others.” 



The Abolition of Design 

Based on my research (and from noticing many of the landmarks at Cambridge), I had 
learned that the historical record on this point was irrefutable. I began to wonder: How 
could the act of invoking something so foundational to the history of science as the idea 
of design now completely violate the rules of science itself, as I had repeatedly heard 
many scientists assert? If belief in intelligent design first inspired modem scientific 
investigation, how could mere openness to the design hypothesis now act as a “science 
stopper” and threaten to put an end to productive scientific research altogether, as some 
scientists feared? 

Clearly, the idea of intelligent design had played a formative role in the foundation of 
modern science. Many great scientists had proposed specific design hypotheses. This 
seemed to suggest to me that intelligent design could function as a possible scientific 
hypothesis. But many contemporary scientists rejected this idea out of hand. Why? 



The Nature of Historical Scientific Reasoning 

I remembered that my Dallas mentor, Charles Thaxton, thought that many scientists 
today rejected design out of hand because they failed to recognize that there were 
different types of scientific inquiry, specifically, that there was a distinctive kind of 
scientific inquiry concerned with investigating and explaining the past. His distinction 
between origins and operations science suggested a reason to consider an “intelligent 
cause” as a possible scientific explanation for the origin of life. Thaxton thought that 




since the origin of life was a unique historical event, and since the singular actions of 
intelligent agents often produce unique events, the act of a designing agent might provide 
an appropriate kind of scientific explanation for this event. This made sense when I first 
encountered Thaxton’s distinction. It seemed to me that invoking an “intelligent cause” 
as the explanation for a past event might be a perfectly reasonable thing to do for the 
simple reason that the activity of an intelligent agent might have actually caused the event 
in question. 

It now occurred to me that the great scientists who had proposed design hypotheses 
during the scientific revolution had typically done so to explain the origin of some system 
or feature of the world — the eye, solar system, the universe, the “correlation of parts” in 
biological systems. 18 So perhaps Thaxton was right. Perhaps the design hypothesis could 
be formulated as a rigorous scientific explanation for events such as the origin of life. But 
to decide that question I would need to know how scientists investigated the past and how 
they formulated and justified their theories. And so the focus of my investigation shifted 
from an examination of the features of life that needed to be explained (see Chapters 3 
through 5) to a study of how historical scientists go about explaining the origin of 
particular features and events in the remote past (see Chapter 7). 
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Of Clues to Causes 



When I left Dallas in 1986 I was seized by questions that had emerged in my discussions 
with Charles Thaxton. Could the inference to design from DNA be formulated as a 
rigorous scientific argument? Is there a distinctive scientific method for studying the 
remote past? When I left for Cambridge, I was intrigued by Thaxton’ s ideas, but not fully 
convinced. For one thing, I didn’t find his account of the distinction between the two 
types of science fully satisfying. The distinction seemed plausible, but needed further 
clarification and justification. 

Even so, I soon discovered that other scientists had made comparable distinctions, though 
usually using different terminology. Stephen Jay Gould, the Harvard paleontologist and 
historian of science, insisted that the “historical sciences” such as geology, evolutionary 
biology, and paleontology used different methods than did “experimental sciences” such 
as chemistry and physics. Interestingly, he also argued that understanding how historical 
sciences differed from experimental sciences helped to legitimate evolutionary theory in 
the face of challenges to its scientific rigor by those who questioned its testability. Gould 
argued that historical scientific theories were testable, but not necessarily by experiments 
under controlled laboratory conditions. Instead, he emphasized that historical scientists 
tested their theories by evaluating their explanatory power. 1 

Could the same thing be true of the design hypothesis? Perhaps a theory of intelligent 
design could be formulated as a historical scientific theory about what had happened in 
the past. If so, perhaps such a theory could be tested by assessing its explanatory power 
rather than its ability to generate predictions in a controlled laboratory setting. 

In my second year at Cambridge, after I had acquired a better understanding of what 
origin-of-life researchers needed to explain, I began to investigate the questions that first 
emerged in my discussions with Thaxton. Is there a distinctive mode of historical 
scientific investigation? If so, how do scientists reason and make inferences about the 
past? Are these inferences testable? If so, how? And in the back of my mind another 
question continued to linger: If there is a historical way of scientific reasoning, could 
such reasoning be used to make a rigorous scientific case for intelligent design? 

As I began to study how scientists investigate the past, I examined the works of the 
nineteenth-century scientists and philosophers of science who first developed and refined 
these methods. This eventually led me to the masters: to Charles Darwin and his mentor, 
the geologist Charles Lyell, and to a few lesser known luminaries who, along with 
Darwin and Lyell, pioneered a method of scientific investigation designed to illuminate 
the history of life, the earth, and the cosmos. 



The Science of Past Causes 



Visit the Royal Tyrrell Museum in Alberta, Canada, and you will find a beautiful 

reconstruction of the Cambrian seafloor with its stunning assemblage of ancient 

organisms. Or read the fourth chapter of Cambridge paleobiologist Simon Conway 

Morris’s book on the Burgess Shale and you will be taken on a vivid guided tour of an 
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ancient marine environment teaming with exotic life-forms. How do scientists come to 
know about such ancient environments? What methods of reasoning do they use to 
investigate what happened on the earth so long ago? 

I soon discovered that I wasn’t the first person to ask such questions. During the 1830s 
and 1840s, William Whewell, a distinguished scientist and philosopher and the master of 
Trinity College, Cambridge, published two books about the nature of science that 
addressed this issue. 4 The publication of Whewell’s work coincided with a surge of 
interest in natural history in Victorian England. By the 1830s, Charles Lyell had 
published his seminal Principles of Geology, the first scientific descriptions of dinosaur 
fossils had been made, and new evolutionary ideas about the history of life were 
circulating in elite scientific societies. 

In his volumes History of the Inductive Sciences (1837) and The Philosophy of the 
Inductive Sciences (1840), Whewell distinguished sciences such as physics and chemistry 
from what he called palaetiology — literally, the “study of past causes.” And he argued 
that “palaetiological,” or historical, sciences could be distinguished from nonhistorical 
sciences in three ways. 5 First, the palaetiological, or historical, sciences have a distinctive 
objective: to determine ancient conditions or past causes, as opposed to establishing 
universal laws by which nature generally operates. 6 

Second, such sciences explain present events (“manifest effects”) by reference to past 
(causal) events, rather than by reference to general laws (though Whewell acknowledged 
that laws often play a supporting role in such explanations). Historical scientists cite the 
occurrence of an event or series of events in the past as the explanation for some 
observable phenomenon in the present. 8 For example, a geologist might explain the 
origin of the Alps as the result of a series of geological events involving the collision of 
specific tectonic plates, the overthrusting of sedimentary layers, and then the subsequent 
folding and faulting of those layers. As science historian Jonathan Hodge explains, 
Whewell realized that historical sciences do not study “forces that are permanent causes 
of motion, such as gravitational attraction,” but “causes that have worked their effects in 
temporal succession.” 9 

Third, in their attempt to reconstruct “ancient conditions,” Whewell argued that historical 
scientists also utilized a distinctive mode of reasoning. 10 Using knowledge of 
cause-and-effect relationships, historical scientists “calculate backwards” and “infer” past 
conditions and causes from “manifest effects.” 11 As Gould later put it, historical 
scientists proceed by “inferring history from its results.” For example, in order to 
reconstruct the Cambrian environment, paleontologists such as Gould and Conway 
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Morris made inferences about the past based on present-day fossils and other clues. 
They inferred a past environment and set of conditions as the cause of the evidence they 
found. Like other historical scientists, they reasoned from clues back to causes . 14 



Abductive Reasoning 

This type of reasoning is called abductive reasoning or abduction. It was first described 
by the American philosopher and logician Charles Sanders Peirce. He noted that, unlike 
inductive reasoning, in which a universal law or principle is established from repeated 
observations of the same phenomena, and unlike deductive reasoning, in which a 
particular fact is deduced by applying a general law to another particular fact or case, 
abductive reasoning infers unseen facts, events, or causes in the past from clues or facts 
in the present . 15 

As Peirce himself showed, there is a problem with abductive reasoning . 16 Consider the 
following syllogism: 



If it rains, the streets will get wet. 
The streets are wet. 

Therefore, it rained. 



In this syllogism, a past condition (it was raining) is inferred from a present clue (the 
streets are wet). Nevertheless, this syllogism commits a logical fallacy known as 
affirming the consequent. Given that the street is wet (and without additional evidence to 
decide the matter), one can only conclude that perhaps it rained. Why? Because there are 
other possible ways the street may have gotten wet. A street-cleaning machine, an 
uncapped fire hydrant, or rain might have caused the street to get wet. It can be difficult 
to infer the past from the present because there are often many possible causes of a given 
effect. When this is the case, abductive inferences yield plausible, but not certain, 
conclusions. 

For Peirce, this raised an important question: How is it that, despite the logical problem 
of affirming the consequent, we nevertheless frequently make conclusive inferences 
about the past? He noted, for example, that no one doubts the existence of Napoleon. Yet 
we use abductive reasoning to infer Napoleon’s existence. That is, we infer his past 
existence not by traveling back in time and observing him directly, but by inferring his 
existence from our study of present effects, namely, artifacts and records. But despite our 
dependence on abductive reasoning to make this inference, no sane and educated person 
would doubt that Napoleon Bonaparte actually lived. How could this be if the problem of 
affirming the consequent bedevils our attempts to reason abductively? Peirce’s answer 
was revealing: “Though we have not seen the man [Napoleon], yet we cannot explain 
what we have seen without the hypothesis of his existence .” 17 Peirce suggested that a 
particular abductive hypothesis can be firmly established if it can be shown that it 
represents the best or only explanation of the “manifest effects” in question. 




As Peirce noted, the problem with abductive reasoning is that there is often more than 
one cause that can explain the same effect. To address this problem in geology, the 
late -nineteenth-century geologist Thomas Chamberlain delineated a method of reasoning 
he called the “method of multiple working hypotheses.” 18 Geologists and other historical 
scientists use this method when there is more than one possible cause or hypothesis to 
explain the same evidence. In such cases, historical scientists carefully weigh the relevant 
evidence and what they know about various possible causes to determine which best 
explains it. Contemporary philosophers of science call this the method of “inference to 
the best explanation.” That is, when trying to explain the origin of an event or structure 
from the past, historical scientists compare various hypotheses to see which would, if 
true, best explain it. They then provisionally affirm the hypothesis that best explains the 
data as the most likely to be true. 



Peter Lipton and Inference to the Best Explanation 

In the spring of 1990, as I was pressing to finish my Ph.D. dissertation, I had the good 
fortune to meet an American philosopher of science who helped me understand abductive 
reasoning. Peter Lipton had just been hired away from Williams College in 
Massachusetts by the Cambridge Department of History and Philosophy of Science. After 
assuming the position of lecturer (the British equivalent of assistant professor), he 
eventually rose to become professor and chair of the department before his premature 
death in 2007 at the age of fifty-three. While Lipton was visiting Cambridge in the spring 
of 1990, one of the Cambridge faculty members, seeing that we shared some common 
interests, introduced us. Lipton had just finished writing a major treatise called Inference 
to the Best Explanation, which later would become something of a minor classic in 
contemporary philosophy of science. Given what I had learned about how historical 
scientists needed to weigh competing hypotheses, I was immediately interested in 
Lipton’s ideas. He kindly sent me a pre-publication copy of his manuscript. 

Lipton had made a systematic study and defense of a way of reasoning that he called 
“inference to the best explanation” (IBE). He argued that this way of reasoning was used 
commonly in science and ordinary life. For example, he noted, that “faced with tracks in 
the snow of a peculiar shape, I infer that a person with snowshoes has recently passed this 
way. There are other possibilities, but I make this inference because it provides the best 
explanation of what I see.” 19 From the examples he used to illustrate “inference to the 
best explanation,” it was clear that this method of reasoning was especially useful to 
scientists who were trying to provide causal explanations of events or circumstantial 
evidence. 

I was impressed by Fipton’s work. He wrote clearly, using practical scientific case 
studies to illustrate his ideas about this scientific method. His work employed none of the 
jargon or penchant for abstraction that sometimes gave philosophers of science a bad 
name with working scientists. Instead, he obviously respected the practice of science and 
wanted to give an accurate account of how scientists actually fonnulate and justify their 
hypotheses and inferences. 





Figure 7.1. Peter Lipton. Printed by permission from Howard Guest. 



Interestingly, in his Oxford doctoral dissertation, Lipton initially had argued against 
inference to the best explanation as an accurate understanding of scientific reasoning. He 
did this, he said, because he didn’t have time as a doctoral student to mount a sufficiently 
comprehensive case for it. Instead, he decided to make the best case against it, so that he 
would fully understand the objections to it as a way of preparing himself to write a book 
defending it. His story revealed a person capable of weighing the merits of competing 
arguments before deciding a question. Lipton practiced what he preached. 

According to Lipton, “beginning with the evidence available to us” we generally “infer 
what would, if true, best explain that evidence.” 20 This echoed what Peirce had said 
about how an abductive inference can be strengthened by showing that it alone can 
explain the evidence. But Lipton made the connection between inference and explanation 
more explicit and showed how considerations of explanatory power often influenced 
assessments of competing inferences or hypotheses. Peirce and Gould had convinced me 
that historical scientists used a distinctive kind of inference — in which past causes are 
inferred from present effects. Whewell and other philosophers of science showed that 
historical scientists formulated distinctive types of explanations — ones in which past 
events or causes are invoked to explain particular facts. Lipton now showed how the two 
intellectual activities of inferring and explaining are connected. 21 




He showed that scientists often evaluate competing inferences or hypotheses by 
comparing their explanatory power. That is, assessments of an inference’s explanatory 
power determine how much stock we place in it. In making this argument, Lipton 
challenged a popular conception of science, namely, that scientists test their theories only 
by evaluating the accuracy of the predictions that their theories make about future 
events. Lipton demonstrated that the ability to explain known facts often mattered as 
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much or more than predictive success in the evaluation of an inference or hypothesis. 

But there was a problem with this kind of reasoning, one that Lipton and many 
philosophers of science had noted. Many regarded “inference to the best explanation” as 
little more than a slogan, because no one could say exactly what made an explanation 
best. “Sure,” many argued, “we often infer hypotheses that, if true, best explain the 

94 

evidence, but what does it mean to explain something and to explain it best?” 

As a first step toward answering the question, “What makes an explanation best?” Lipton 
gave a general account of what a good explanation should look like. He noted that 
scientists sometimes explain general classes of phenomena (such as condensation) and 
sometimes a particular fact or event (such as the extinction of the dinosaurs). According 
to Lipton, good explanations — whether of particular one-time events or general 
phenomena — are typically causal. 

In the case of general phenomena such as condensation, good explanations typically cite 
causal mechanisms. For example, a physicist would explain condensation by citing a 
general mechanism of heat exchange. (When warm water vapor contacts a cold solid, the 
colder solid draws heat out of the gas, causing the water to change from a gaseous to a 
liquid state.) 

In the case of particular events, Lipton showed that a good explanation cites a prior 
cause, typically an event that is part of the “causal history” of the event in question. 
Further, Lipton showed that to identify the cause of an event, scientists must identify 
something within the causal history of the event that accounts for a crucial 
difference — the difference between what did occur and what we might have otherwise 
expected. Good explanations answer questions of the form, “Why this rather than that?” 
A good (or best) explanation cites an event that makes a “causal difference” in 
outcome . 25 

Here’s an example of what Lipton had in mind. Imagine that a paleontologist wants to 
explain why the dinosaurs went extinct in the Cretaceous period. He realizes that many 
events led up to the extinction of the dinosaurs, from the big bang to the formation of 
continents to the origin of the dinosaurs themselves. All these events are part of the 
“causal history” of dinosaur extinction. But of these events, the one that best explains the 
extinction is the event — the cause — that accounts for the difference between what 
happened and what we might have otherwise expected to happen — in this case, the 
difference between the extinction of the dinosaurs and the dinosaurs continuing to roam 
the earth. The big bang is part of the causal history of extinction of the dinosaurs, but it 




doesn’t explain the extinction. Why? It does not account for the difference in outcome. 
Whether the dinosaurs had continued to live or became extinct, the big bang would still 
have been part of either causal history. 

As it happens, paleontologists have proposed a theory that cites a causal difference. As an 
explanation for a variety of evidence, they have inferred that a massive meteorite hit the 
earth at the end of the Cretaceous period, causing an environmental catastrophe. They 
postulate that the meteorite impact generated dust and debris that blocked sunlight and 
cooled the earth, eventually killing the cold-blooded dinosaurs by destroying their food 
supply. 

This explanation illustrates Lipton’s conception of a sound causal explanation. It explains 
one event (the extinction of the dinosaurs) by citing another prior event (the meteorite 
impact and subsequent environmental catastrophe) as the cause. It also accounts for the 
difference between what happened (the extinction of the dinosaurs) and what otherwise 
might have been expected to happen (the dinosaurs continuing to thrive). Indeed, had the 
meteorite not hit the earth, the dinosaurs would, presumably, have continued to live 
beyond the Cretaceous period. Thus, the meteorite impact could have made a causal 
difference. The impact hypothesis also could explain many other evidences — such as the 
iridium levels in rocks from the period and the presence of an impact crater in the 
Yucatan Peninsula — and so it has broad explanatory power. None of these facts alone 
prove the hypothesis is correct, but they provide support for it and put it in contention as 
a possible best explanation. 

In Inference to the Best Explanation, Lipton went on to develop an even more 
comprehensive account of causal explanation in order to enhance his characterization of 
how the method of inference to the best explanation works. In the process, he also 
provided a general, if partial, answer to the question, “What makes an explanation best?” 
Because of his more general focus on how IBE works, however, Lipton did not address 
the specific questions about the nature of historical scientific reasoning that most 
interested me. Nevertheless, his general description of how scientists evaluate inferences 
by assessing their explanatory power aptly characterized how historical scientists 
evaluate their theories. Indeed, many of his examples involved scientists trying to explain 
particular events by reference to past causes. 

Moreover, his method directly addressed the main problem that often arose in historical 
scientific investigations. Lipton noted that sometimes more than one possible explanation 
might satisfy his general criteria of “best” by citing a past cause that could conceivably 
make a causal difference. For example, paleontologists have proposed other possible 
causes for the extinction of the dinosaurs — such as a sharp increase in volcanic 
activity — that could explain why the dinosaurs went extinct rather than continuing to 
exist. Lipton noted that in such cases, where more than one appropriately causal 
explanation is available, scientists use a comparative method of evaluation and a process 

o zr 

of elimination to evaluate competing possible causal hypotheses. 




At the same time, Lipton did not attempt to establish criteria for determining which past 
cause among a group of possible causes constituted the best explanation in such cases. 
Fortunately, however, I discovered that the founders of several historical sciences had 
developed their own practical criteria for detennining what made an explanation of a past 
event “best.” These criteria supplemented Lipton’s work in a way that made them 
practically relevant to scientists investigating the past and helped to answer the question, 
“What makes an explanation of a past event ‘best’?” 



Causes Now in Operation 

Historical scientists have developed two key criteria for deciding which cause, among a 
group of competing possible causes, provides the best explanation for some relevant body 
of evidence. The most important of these criteria is called “causal adequacy.” This 
criterion requires that historical scientists, as a condition of a successful explanation, 
identify causes that are known to have the power to produce the kind of effect, feature, or 
event in need of explanation. In making these determinations, historical scientists 
evaluate hypotheses against their present knowledge of cause and effect. Causes that are 
known to produce the effect in question are judged to be better candidates than those that 
are not. 

For instance, a volcanic eruption provides a better explanation for an ash layer in the 
earth than an earthquake, because eruptions have been observed to produce ash layers, 
whereas earthquakes have not. A receding lakeshore offers a better explanation of ripple 
marks in sedimentary rock than does a volcanic eruption, because evaporating lakes have 
been observed to produce ripple marks, whereas volcanic eruptions have not. 

One of the first scientists to develop this principle was the geologist Charles Lyell, who 
exerted tremendous influence on the development of nineteenth-century historical 
science — and on Charles Darwin, in particular. Darwin read Lyell’s magnum opus, The 
Principles of Geology, on the voyage of the Beagle and employed its principles of 
reasoning in On the Origin of Species. The subtitle of Lyell’s Principles summarized the 
geologist’s central methodological principle: Being an Attempt to Explain the Former 
Changes of the Earth ’s Surface, by Reference to Causes Now in Operation. Lyell argued 
that when historical scientists are seeking to explain events in the past, they should not 
invoke unknown or exotic causes, the effects of which we do not know; instead, they 
should cite causes that are known from our uniform experience to have the power to 
produce the effect in question." Historical scientists should cite “ causes now in 
operation, ” or presently acting causes. This was the idea behind his uniformitarian 
principle and the dictum, “The present is the key to the past.” According to Lyell, our 
present experience of cause and effect should guide our reasoning about the causes of 
past events. Or as Whewell explained, “Our knowledge respecting [past] causes. . .must 
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be arrived at by ascertaining what the causes of change in such matters can do.” 

Darwin himself adopted this methodological principle. His tenn for a presently acting 
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cause was a vera causa, that is, a true, known, or actual cause. Darwin thought that 




when explaining past events, scientists should seek to identify established causes — causes 
known to produce the effect in question. Darwin appealed to this principle to argue that 
presently observed microevolutionary processes of change could be used to explain the 
origin of new fonns of life in the past. Since the observed process of natural selection 
can produce a small amount of change in a short time, Darwin argued that it was capable 
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of producing a large amount of change over a long period of time. In that sense, natural 
selection was “causally adequate.” 

In the late 1950s and early 1960s, a leading philosopher of science from the University of 
California at Berkeley named Michael Scriven made a detailed study of how historical 
scientists reconstruct the past. He argued that historical scientists use a distinctive method 
of reasoning that he called “retrospective causal analysis.” In his description of this 
method of reasoning, he echoed the importance of the causal-adequacy criterion affirmed 
by Darwin and Lyell. According to Scriven, in order to establish the cause of a past 
event, historical scientists must first show that any candidate cause “has on other 
occasions clearly demonstrated its capacity to produce an effect of the sort here under 
study.” As Scriven put it, historical scientists must show that “ independent evidence 
supports the claim that it [the cause] can produce this effect.” 34 



And Then There Was One 

Both philosophers of science and leading historical scientists have emphasized causal 
adequacy as the main criterion by which competing hypotheses are adjudicated. But 
philosophers of science also have noted that assessments of explanatory power lead to 
sound inferences only when it can be shown that there is one known cause for the effect 
or evidence in question. Michael Scriven, for example, points out that historical scientists 
can make inferences about the past with confidence when they encounter evidence or a 
condition for which there is only one known cause. When historical scientists know 
that there is only one known cause of a given effect, they can infer the cause from the 
occurrence of its effect. When scientists can infer a uniquely plausible cause, they can 
avoid the fallacy of affirming the consequent — the error of ignoring other possible causes 
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with the power to produce the same effect. 

In his book Reconstructing the Past, Elliott Sober shows that the tasks of inferring the 
past may be easy or difficult (or even impossible) depending upon whether there are 
many causes that produce the effect in question or just one. He suggests that the severity 
of the problem confronting historical scientists will depend on whether the processes 
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linking the present and past are “information preserving” or “information destroying.” 
To Sober, an information-preserving process is one that maps a present state to a single 
past state, whereas an infonnation-destroying process is one that maps a present state to a 
multiplicity of possible past states. He illustrates these two general cases with the 
following idealized diagram: Sober uses a simple mechanical system to illustrate what 
makes reconstructing the past either easy or difficult. He notes that if someone places a 
metal ball on the rim of a concave metal bowl and releases it, it may traverse a complex 
path before coming to rest at the bottom of the bowl. Observing the ball in its final resting 




position (at equilibrium) will reveal little about the ball’s initial position, since the ball 
could have been released from many places on the rim and still ended up at the bottom of 
the bowl. Imagine, however, that a ball is rolled down an indented track with high 
boundaries on either side and a large hole at the bottom, so that the ball will always come 
to rest in the hole just below the place where it is released. In this case, the final resting 
place of the ball will tell far more about the initial position of the ball. Indeed, the final 
state will correspond to just a single initial state, making it easy to reconstruct the ball’s 
initial position from the observation of its final position. J 8 




Figure 7.2. Schematic of the logical problem of retrodiction. Whether reconstructing the 
past is easy or difficult depends upon whether there is a single cause or condition that 
gives rise to a given present state or whether there are many possible past causes or 
conditions that give rise to a given present state. The diagram on the left portrays an 
information-destroying situation, in which many past causes (or conditions) correspond 
to a given present state. The diagram on the right portrays an information-preserving 
situation, in which only one past cause (or condition) corresponds to a present state. 
Adapted from Sober, Reconstructing the Past, 4. 



Sober’s illustration suggests that if scientists can discover an effect for which there is 
only one plausible cause, they can infer the presence or action of that cause in the past 

'JQ 

with great confidence. But if they can’t, establishing a past cause is much more 
difficult. Two logical syllogisms illustrate the nub of the issue. First form: 



If X, then Y. 
Y occurred. 
Therefore X. 



Arguments of this fonn commit the fallacy of affirming the consequent. Nevertheless, 
such arguments can be restated in a logically compelling form if it can be shown that Y 
has only one kn own cause (namely, X) or that X is a necessary condition of Y: 



X is necessary to the occurrence of Y. 
Y exists. 

Therefore, X existed. 



Thus, this form of argument is a logically valid form of historical inference in which a 
past cause or event can be established from the effect alone . 40 For instance, the 
archaeologist who knows that scribes are the only known cause of linguistic inscriptions 
will, upon discovering tablets containing ancient writing, infer scribal activity. The 
historical geologist who kn ows that a volcanic eruption is the only known cause of 
widespread deposits of volcanic ash will infer that a past eruption caused the ash layer 
discovered today. As British philosopher Walter Bryce Gallie noted, where a particular 
past cause is known to be necessary to produce a subsequent effect, the occurrence of 
“the effect is taken as sufficient to establish the occurrence of the cause .” 41 



Another Case 

But what do historical scientists do in the more difficult case where there is more than 
one possible cause — more than one causally adequate explanation — of the evidence in 
question? Scientists will look for additional evidence that can help discriminate the 
explanatory power of the remaining explanations. Ideally in such situations, scientists try 
to discover an additional fact or group of facts for which there is only one adequate 
causal explanation. In other words, if scientists investigating the past can’t find a single 
fact that can be explained uniquely well (by reference to the only cause known of that 
type of fact), they try to find a wider group of facts for which only one adequate 
explanation exists . 42 

In many such cases, of course, the investigators will have to work their way back to the 
actual cause one painstaking step at a time. For instance, both wind shear and 
compressor-blade failure could explain an airline crash, but the forensic investigator will 
want to know which one did or if the true cause lies elsewhere. If both causes can explain 
the brute fact of the crash, the investigator will need to seek additional evidence about 
some specific aspect of the crash that only one of the candidate explanations can 
explain — maybe some evidence from the flight recorder or at the crash scene. Ideally, the 
investigator will find an additional piece of evidence or suite of evidences for which there 
is only one known cause, allowing him or her to distinguish between competing 
explanations and to eliminate every explanation but the correct one. 

Here’s an idealized illustration that I developed with some colleagues to show how this 
process works . 43 Suppose I wake up from a nap and see that the driveway at my house is 
glistening with water. The car in the driveway is also wet. From these two facts, or pieces 
of evidence, what can I conclude? There are actually several “causally adequate” 
explanations. It might have rained while I was napping, or the automatic sprinklers may 
have come on, or someone may have sprayed the car and driveway with the hose. With 




only the data that the driveway and car are wet, all these explanations are credible 
possibilities. Each cause cited is known to be capable of producing the effect in question. 

How do I decide which of these “causally adequate” explanations best explains the data? 
Clearly, I must look for more evidence. As I do, I discover that the lawn and the street are 
perfectly dry and there isn’t a cloud in the sky. What should I think now? Although the 
sprinkler theory and the rainstorm theory are still possible, these explanations are now 
much less likely in the light of the new evidence. Rainstorms produce wet driveways, but 
usually not wet driveways surrounded by dry yards, streets, and houses. And cloudless 
skies aren’t particularly good candidates for producing rain. Thus, although the rainstorm 
and sprinkler hypotheses provide causally adequate explanations of the first group of 
facts, they fail to do so for the second. 

But now, suppose I look a little harder and see a bucket with soapy water and a sponge 
sitting behind the car. With this additional piece of data, the best explanation for the 
whole group of observations becomes obvious: someone probably washed the car and 
sprayed the driveway with the hose. The car-wash hypothesis now provides the only 
“causally adequate” explanation of the whole ensemble of facts. It does this by providing 
a (merely) adequate explanation of some facts (the wet car and the wet driveway) and a 
uniquely adequate explanation of others (the bucket of soapy water). 

This homespun example shows how we often use the method of inference to the best 
explanation to determine past causes of events when more than one cause can explain the 
same evidence. In my research, I came across many examples of historical scientists 
using precisely this strategy. 

Here’s a famous example from cosmology. During the late 1940s, two theories — the big 
bang and the steady state — could explain the main facts of cosmology. According to the 
big-bang theory, the universe had a definite beginning a finite time ago and has been 
expanding outward ever since as a result of its original explosive beginning. According to 
the steady-state theory, the universe is expanding, but it has existed eternally in a state of 
constant density. Space has been expanding eternally, and as it does, new matter is 
continuously created, thus maintaining the average density of matter in the universe. 
Thus, both theories could explain the evidence of an expanding universe, such as the “red 
shift” of light coming from distant receding galaxies. 

Faced with this situation, cosmologists sought to make additional observations to 
detennine which theory had superior explanatory power. When cosmic background 
radiation was discovered in 1965, it allowed cosmologists to decide this question. The 
big-bang theory postulated an early universe in which all matter was densely compacted, 
whereas the steady-state theory did not. If matter had existed in an early high-density 
state, it subsequently would have produced a ubiquitous background radiation that would 
have filled the universe as it expanded. Thus, the discovery of the background radiation 
helped to establish the superior explanatory power of the big-bang theory. In conjunction 
with other discriminating pieces of evidence, it also resulted in the rejection of the 




steady-state model. Despite an initial parity in explanatory power, the big bang ultimately 
provided a better, more causally adequate explanation of the full range of relevant 
evidence than did the steady state. 

These illustrations show that the process of determining the best explanation often 
involves generating a list of possible hypotheses, comparing their known (or theoretically 
plausible) causal powers against the relevant evidence, looking for additional facts if 
necessary, and then, like a detective, progressively eliminating potential but inadequate 
explanations until, finally, one causally adequate explanation for the ensemble of relevant 
evidence remains. As Scriven explained, such “retrospective causal analysis” “proceeds 
by the elimination of possible causes .” 44 



Causal Existence Criterion 

There is another way of thinking of the problem of reconstructing the past when there are 
several possible causes of the same effect. If there is more than one causally adequate 
explanation of the same evidence, then scientists need to establish which of the adequate 
causes was actually present and responsible for the event in question. For that reason, 
some philosophers of science have argued that “best” explanations of past events must 
meet two criteria: causal adequacy and causal existence. To meet this second condition of 
a “best” explanation, a scientist must show that a proposed cause is not only capable of 
producing the event in question, but that the cause was present — that it existed at the right 
time and place — so as to have had the opportunity to produce the event in question. As 
Michael Scriven insisted, historical scientists not only have to show that a postulated 
cause could have produced the effect in need of explanation; they need “to show... that 
this cause was present .” 45 

Scriven notes that, for any given effect or phenomenon, there might be “a list of possible 
causes,” each of which is known to be capable of producing the effect or event in 
question . 46 Citing one of the possible causes on such a list, however, would not 
necessarily explain the event, even if the cause is known to have the power to produce the 
same kind of event on other occasions. Why? Because the event cited as the cause (as 
opposed to another possible cause) may not have actually occurred. Thus, he states, “We 
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do not explain [a particular event]. . .merely by mentioning some possible cause.” 

An example will help to illustrate. In order to explain the extinction of the Irish elk, a 
biologist must do more than just mention events that could have caused the extinction 
(such as an invasion of predators or the pollution of the elk’s habitat). After all, these 
possible causal events might not have occurred. And obviously, if these events didn’t 
happen, they cannot have caused the extinction and won’t provide the best explanation of 
it. Instead, a biologist must also show that one of these events actually occurred, that 
predators actually invaded the elk’s environment or that an increase in pollution ruined 
the elk’s habitat. Scientists need to establish the past existence of the cause (in proximity 
to the effect) as well as the cause’s adequacy. 




So how do historical scientists establish that a given cause existed in the past? How can 
scientists meet this condition of causal existence? They have two ways of doing this, both 
of which I have already described. First, historical scientists can show that a presently 
acting cause must have been present in the past because the proposed candidate is the 
only known cause of the effect in question. If there is only one possible cause of a salient 
piece of evidence, then clearly the presence of that evidence establishes the past existence 
of its cause. Second, historical scientists can establish the existence of a cause by 
examining a wider class of facts to show that only one of the possible causes explains the 
whole ensemble (i.e., the main fact that needs explanation as well as other associated 
facts). If there is only one cause capable of producing all the relevant facts, then, again, 
that cause must have been present in the past in order to produce them. 

This latter strategy was used in the wet-driveway illustration. Since there were many 
possible causes by which the driveway could have gotten wet, determining which cause 
was actually responsible for the wet driveway required comparing the explanatory power 
of the various hypotheses against a group of associated facts. This evaluation established 
that only one of the otherwise adequate causes of the wet driveway also provided an 
adequate explanation of all the other relevant facts. Thus, it also established that someone 
had washed the car and that the causal factors implied by that hypothesis — a spraying 
hose and sloshing bucket — had been present in the past. Thus, the car-wash hypothesis 
met the causal existence criterion (and, having already met the causal-adequacy criterion, 
also qualified as the best explanation). 48 

Clearly, the way historical scientists go about meeting the causal-existence criterion 
involves further assessments of causal adequacy, so in practice the two criteria are closely 
related. Meeting one criterion (causal existence) depends upon how well a hypothesis 
meets the other (casual adequacy) — either with respect to a single salient fact or a wider 
ensemble of associated facts. For this reason, many historical scientists may not make 
any explicit mention (or even be aware) of seeking to meet a separate causal-existence 
criterion. Indeed, they need not do so, because they can ensure that this criterion gets met 
just by identifying a uniquely adequate causal explanation, either with respect to a single 
salient fact or with respect to a wider ensemble. If a single fact does not sufficiently 
discriminate the explanatory power of competing hypotheses, then historical scientists 
intuitively seek additional facts until they eliminate all but the best (most causally 
adequate) explanation. In so doing, they meet the causal-existence criterion and identify 
the best explanation at the same time. 



The Nature of Historical Science 

Even so, Michael Scriven describes the historical method of “retrospective causal 
analysis” as a simple three-part method of evaluating proposed causal explanations. 
Many historical scientists find it helpful to think of it this way. According to Scriven, in 
order to establish a causal claim, the historical scientist needs: (1) “evidence that his 
candidate [cause] was present” and (2) evidence that “it has on other occasions clearly 




demonstrated its capacity to produce an effect of the sort here under study.” Additionally, 
the historical scientist needs to establish that there is (3) an “ absence of evidence (despite 
a thorough search). . .of. . .other possible causes.” 49 

Many fields that seek to explain events by reference to past causes — such as forensic 
science, evolutionary biology, paleontology, geology, archeology, and cosmology — use 
this kind of “retrospective causal analysis.” 50 Indeed, many scholars think that Charles 
Darwin structured his case in the Origin to show that natural selection met both the 
causal-adequacy and causal-existence conditions of a best explanation 5 1 and, in so doing, 
that he was able to test his theory as well. 

According to Darwin, his theory of “universal common descent” could be tested not by 
using it to predict future outcomes under controlled experimental conditions, but by 
showing that it could explain already known facts in a better, a more causally adequate 
way than rival hypotheses. As he explained in a letter to Asa Gray: “I... test this 
hypothesis [universal common descent] by comparison with as many general and pretty 
well-established propositions [facts] as I can find.... And it seems to me that, supposing 
that such a hypothesis were to explain such general propositions, we ought, in accordance 
with the common way of following all sciences, to admit it till some better hypothesis be 
found out.” 52 

Taking Stock 

And so my study of how historical scientists reconstruct past causes led me to conclude 
that Charles Thaxton had been on to something. There does, indeed, seem to be a 
distinctive method of historical scientific reasoning and investigation. Historical scientists 
have a distinctive objective (to identify the causes of past events); they fonnulate 
distinctive types of explanation (in which they cite past events as causes); and they make 
inferences with a distinctive (abductive) logical fonn. 

But I concluded that Thaxton was mistaken about something as well. The historical 
sciences were not quite as unique as he thought. Historical scientists use a common 
method of evaluating their theories in which they evaluate the relative explanatory power 
of an inference to determine its strength, plausibility, or likelihood. This meant that 
historical scientific theories, like other scientific theories, were testable, albeit by 
reference to distinctively historical criteria for determining when an explanation qualifies 
as “best.” Historical scientists test their theories against the explanatory power of their 
competitors and against our knowledge of the “causal adequacy” or “causal powers” of 
various entities. They make these assessments based upon our present knowledge of 
established cause-and-effect relationships. The criteria of causal adequacy and to a lesser 
extent causal existence, developed by Lyell and used by Darwin, constitute critical 
experience-based tests of historical scientific theories, tests that can be used to 
discriminate the explanatory power (and merit) of competing hypotheses. 




Clearly, this method of testing scientific ideas is different from that used by experimental 
scientists, who test their theories by making predictions about what will happen under 
controlled laboratory conditions. Even so, historical scientists are not the only scientists 
to use it. Arguably, Watson and Crick used this method to test their ideas about the 
structure of DNA against competing models. And many scientists — theoretical physicists, 
biochemists, psychologists, astronomers, pathologists, medical diagnosticians — as well 
historians, detectives, and thinking people everywhere use this method of reasoning every 
day to make sense of their experiences. 

In any case, whether the historical sciences use an absolutely distinctive scientific method 
now seemed less important to me than understanding exactly how conclusions about past 
events and causes could be rigorously established. And my study of how historical 
scientists fonnulated their inferences and arguments gave me a clear understanding of 
this. Historical scientists evaluate the strength of competing abductive inferences by 
comparing their explanatory power. In the best of cases, they will infer a clearly best 
explanation — one that cites the “only known cause” of the effect or effects in question. 



Intelligent Design as a Possible Scientific Explanation? 

What did all this have to do with the DNA enigma? Quite simply this. I wondered if a 
case for an intelligent cause could be fonnulated and justified in the same way that 
historical scientists justify any other causal claim about an event in the past. What I 
learned about how historical scientists determine the causes of past events gave me 
another reason to think that intelligent design was at least a possible (historical) scientific 
explanation for the origin of biological infonnation. As conceived by advocates of the 
theory, the activity a conscious designing agent in the past constituted a causal event (or 
series of such events). Moreover, an “intelligent cause” might well have made a causal 
difference in outcome. The activity of a designing intelligence might account, for 
example, for the difference between a chaotic prebiotic environment in which simple 
chemical constituents interacted with each other in accord with natural entropic 
(randomness-producing) processes and an environment in which systems with highly 
specific arrangements of chemicals arose that stored information for building proteins 
and protein machines. Intelligent design could have made a causal difference between 
continuing chemical chaos and the origin of information-rich molecules. 

Moreover, my study of historical scientific reasoning also suggested to me that it was at 
least possible to formulate a rigorous case for intelligent design as an inference to the best 
explanation, specifically, as the best explanation for the origin of biological information. 
Not only was it was possible to conceive of the purposeful act (or repeated action) of an 
intelligent agent as a causal event; the action of a conscious and intelligent agent clearly 
represented both a known (presently acting) and adequate cause for the origin of 
information. Uniform and repeated experience affirms that intelligent agents produce 
information-rich systems, whether software programs, ancient inscriptions, or 
Shakespearean sonnets. Minds are clearly capable of generating specified information. 




Conclusion: The Causes Now in Operation 



When I first noticed the subtitle of Lyell’s book, “By Reference to Causes Now in 
Operation a light came on for me. I immediately asked myself: What causes now in 
operation produce digital code or specified information? Is there a known cause — a vera 
causa — of the origin of such information? What does our uniform experience tell us? As 
I thought about this further, it occurred to me that by Lyell and Darwin’s own rule of 
reasoning and by their test of a sound scientific explanation, intelligent design must 
qualify as, at least, a possible scientific explanation for the origin of biological 
information. Why? Because we have independent evidence — “uniform experience” — that 
intelligent agents are capable of producing specified infonnation. Intelligent activity is 
known to produce the effect in question. The “creation of new infonnation is habitually 
associated with conscious activity.” 

But is intelligent design the “only known cause” of the origin of specified infonnation? I 
was now more than intrigued with this possibility, especially given the implications for 
the status of the design argument in biology. If intelligent design turned out to be the only 
known or adequate cause of the origin of specified information, then the past action of a 
designing intelligence could be established on the basis of the strongest and most 
logically compelling form of historical inference — an inference from the effect in 
question (specified information) to a single necessary cause of that effect (intelligent 
activity). Moreover, if intelligent design were shown to be the only known cause, if it 
were shown to be a uniquely adequate cause, then intelligent design would also 
automatically meet the causal-existence criterion of a best explanation as well. 

Is intelligent design in fact the best explanation for the origin of life? I knew that the 
central question facing scientists trying to explain the origin of the first life was precisely: 
How did the sequence-specific digital infonnation necessary to building the first cell 
arise ? 54 At the same time, I knew that origin-of-life researchers had proposed many 
explanations for the origin of biological information, including some new and potentially 
promising approaches. Though I had studied both Oparin’s classical theory of chemical 
evolution and several contemporary alternatives, much of my research had focused on 
methodological issues — on how historical scientists reason about past events and causes. 
These studies convinced me that intelligent design was a possible — a causally 
adequate — explanation for the origin of biological information. But to determine whether 
intelligent design was the best — the only causally adequate explanation — I would need to 
know more about other scientific possibilities. I would need to follow Scriven’s directive 
to “make a thorough search” for and evaluation of other possible causes. Over the next 
several years, as I assumed my duties as an assistant professor, I set out to do exactly that. 
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Chance Elimination and Pattern Recognition 



My work in England on historical scientific reasoning convinced me that an intelligent 
cause could function as a possible scientific explanation for the origin of biological 
information. But I wondered if it could provide a better explanation than the alternatives. 
Could it be inferred as the best explanation for the origin of the specified infonnation in 
the cell? To answer this question, I would need to undertake a rigorous examination of 
the various alternatives. 

By the time I finished my Ph.D. studies, in 1990, I was familiar with most of the main 
theories then current for explaining the origin of life. These theories exemplified a few 
basic strategies of explanation. Some relied heavily on chance — that is, on random 
processes or events. Some invoked lawlike processes — deterministic chemical reactions 
or forces of attraction. Other models combined these two approaches. In this respect, 
research on the origin of life followed a well-established pattern in the sciences, one 
explicitly recommended by the leading scientists of our time. 

During the period of explosive discovery from 1953 through 1965 now known as the 
molecular biological revolution, one of Francis Crick’s colleagues was the French 
biologist Jacques Monod. In 1971 he wrote an influential book called Chance and 
Necessity, extolling the powers of chance variation and lawlike processes of necessity in 
the history of life. 1 As an example of a chance process, Monod noted how random 
mutations in the genetic makeup of organisms can explain how variations in populations 
arise during the process of biological evolution. In Monod’s shorthand, “chance” referred 
to events or processes that produce a range of possible outcomes, each with some 
probability of occurring. The term “necessity,” on the other hand, referred to processes or 
forces that produce a specific outcome with perfect regularity, so that the outcome is 
“necessary” or inevitable once some prior conditions have been established. Since, for 
example, Newton’s law of gravity stipulates that “all unsuspended bodies fall,” scientists 
say that any unsuspended body will “of necessity” fall to the earth. If enough heat is 
applied under a pan of water, the water will eventually — of necessity — boil. 

In Chance and Necessity, Monod did not try to explain the origin of biological 
information or the cell’s system for processing information. Instead, he presupposed this 
system and then argued, following Darwin, that the subsequent history of life could be 
explained solely by reference to the twin factors of chance and necessity. For scientists 
reluctant to consider intelligent design as an explanation for the origin of life, Monod’s 
work codified a normative set of guidelines for approaching this and every other 
scientific problem. His message was clear: scientists can, and should, explain all 



phenomena by reference to chance, necessity, or the combination of the two. And, 
indeed, most scientists have sought to explain the origin of life using one of these 
approaches. 

Following Monod’s order of presentation and the rough sequence in which ideas about 
the origin of biological infonnation appeared after 1953, I began to examine scenarios 
that relied primarily on chance. Could the random interaction of the molecular building 
blocks of life explain the mysterious origin of the biological information needed to 
produce life? Various scientists have proposed this idea, but many origin-of-life 
researchers have expressed reservations about it, because they thought that it amounted to 
little more than saying, “We don’t know what caused life to arise.” And, indeed, as I 
scrutinized the chance-based explanations for life’s origin, I often found them 
disappointing. Specific chance-based proposals were few and far between. And where 
present, they were hopelessly vague. Upon reading them, my reaction was often the 
same: “That’s it?” 

This situation posed a dilemma for me in my investigation of the DNA enigma. Though 
chance was assumed to have played a significant role in the origin of life, few scientists 
could say exactly what that meant or offer any rigorous criteria for evaluating such 
explanations. Is “chance” a cause that can be cited to explain something? Is chance even 
a “something” that can cause anything? Chance-based explanations were, as a rule, so 
thin on detail that it was difficult to assess them. 



What Is Chance? 

My concerns about chance as an explanation boiled down to a few basic issues. What 
does it mean to say that something happened by chance? When is it reasonable to invoke 
chance in an explanation? And what justifies excluding such an explanation from 
consideration? Considering a couple of different situations clarified my thinking about 
these questions. 

In the Pacific Northwest, a now infamous bridge called the Tacoma Narrows (nicknamed 
Galloping Gertie) collapsed into the Puget Sound in 1940 after the bridge began to 
undulate more and more violently during a stiff, but not unusually high, wind. Now 
imagine that the engineers investigating the collapse had come back and, without citing 
any other factors, told the political authorities that the bridge had collapsed “by chance.” 
Would the authorities have found that explanation satisfying or informative? 

Consider a second scenario. A man named Slick has just won at a Las Vegas roulette 
table where the odds of winning are 1 in 38. Although Slick’s name does not exactly 
inspire confidence, the croupier at the game table certifies that he did not cheat and the 
roulette wheel is operating properly. The victory was legitimate. Slick is paid in accord 
with his wager. Why did he win? What explains the ball falling into the pocket 
corresponding to Slick’s bet? The croupier spun the wheel and the ball fell — by 
chance — into the pocket that Slick had chosen. 




A bridge collapsed “by chance.” A gambler won at the roulette table “by chance.” Which 
of these two explanations seems more satisfactory or reasonable? Clearly the second, and 
not the first. But why? 

In the second case, it seems legitimate to attribute the event to chance, because there is a 
known process, namely, the spinning roulette wheel with its 38 compartments, which can 
generate the event of interest — the ball landing on a red 16 — with some regular or 
predictable frequency. To say that this event occurred by chance does not mean that 
chance caused something to happen, but rather that, given the construction of the roulette 
wheel, there is a definite probability — a 1 in 38 chance, to be precise — that the ball will 
fall in the red 16 pocket. Absent indicators of other influences at work, we say that the 
ball fell in the red 16 pocket “by chance” to indicate that we think this event occurred as 
one of the normal possible outcomes of a regular underlying process. 

The case of the investigating engineers, however, is different. They did not specify an 
underlying process capable of producing bridge failure as a possible outcome. They cited 
chance purely to conceal what they did not know about the cause of the bridge failure. 
Any official receiving this answer would have correctly regarded it as an admission of 
ignorance, a fancy way of saying, “We don’t know what happened” or “We can’t explain 
it.” 

There’s a second aspect to this, however. When scientists say that something happened 
by chance, they do not usually mean to deny that something caused the event in question. 
(Some interpretations of quantum physics would stand as an exception.) Instead, they 
usually mean that the event in question occurred because of a combination of factors so 
complex that it would have been impossible to determine the exact ones responsible for 
what happened or to have predicted it. Imagine that I roll a die and it turns up a five. That 
outcome was caused or determined by a number of factors — the exact force applied to the 
die, the orientation of the die as it left my hand, the angle at which it first hit the table, 
and so on. Saying that the five turned up “by chance” does not deny that the event was 
detennined or caused by physical factors, but instead that the exact combination of 
physical factors was so complex that we cannot know them exhaustively and, therefore, 
could not have predicted the precise outcome. 

Thus, when scientists say that something happened by chance, they do not usually mean 
that some entity called “chance” caused something to happen. Instead, they mean that 
there is a process in play that produces a range of outcomes each of which has a chance 
or probability of occurring, including the outcome in question. 



Chance as Negation 

But there is usually more to the notion of chance than that. When scientists attribute an 
event to chance, they also usually mean there is no good reason to think the event 




happened either by design or because of a known physical process that must, of necessity, 
generate only one possible outcome. 

Imagine that a statistician is trying to determine whether a coin is fair by observing the 
distribution of heads and tails that result from flipping it. The “chance” hypothesis in this 
case is that the coin is fair. The alternative hypothesis is that the coin is biased. If the 
distribution of heads and tails comes up roughly even, the statistician will conclude that 
the coin is fair and elect the “chance hypothesis.” In saying this, the statistician is not 
saying that “chance” caused the 50-50 distribution or even that she knows what caused 
any particular outcome. Instead, the affirmation of the chance hypothesis mainly serves to 
negate the alternative hypothesis of bias (in this case, a loaded coin produced by design). 
The essentially negative character of the chance hypothesis is suggested by its other 
common name: the null hypothesis (i.e., the hypothesis to be nullified or refuted by 
alternative hypotheses of design or law like necessity). 

Such negations, implicit or explicit, are part of what give substantive chance hypotheses 
content. To say that the roulette ball fell in the red 16 pocket by chance is also to say that 
the ball was not placed there intentionally and that there was nothing about the 
construction of the roulette wheel that forced the ball to land in the red 16 of necessity. 
On the other hand, the engineers who appeal to chance because they don’t know anything 
about the cause of a bridge collapse don’t make such a negation. Because they know 
nothing about the cause, they are unable to rule out alternatives, whether foul play 
(design) or some underlying structural failure induced by environmental conditions acting 
on a structural defect (necessity). Vacuous appeals to chance neither affirm a cause nor 
negate one. But substantive appeals to chance specify the operation of a relevant 
outcome -producing process, and they also implicitly or explicitly negate (or nullify) other 
possible types of hypotheses. 

William Dembski, R. A. Fisher, and Chance Elimination 

Of course, just because a chance hypothesis avoids being vacuous by making definite 
claims does not necessarily mean that its claims are true. Successful appeals to chance 
must be justified, including their implicit negations of design and necessity. So how do 
statisticians and scientists test substantive (rather than vacuous) chance hypotheses? How 
do they decide when to accept and when to reject chance as an explanation? 

About the time I was asking myself these questions, I met William Dembski, a 
mathematician who later would become one of the best-known proponents of intelligent 
design. In the summer of 1992, I returned to Cambridge to work on a research project 
with Dembski and another colleague, Paul Nelson, who was then working toward a Ph.D. 
in the philosophy of biology at the University of Chicago. 

During the preceding year Nelson and I had been approached by a research institute about 
the feasibility of developing a rigorous theory of intelligent design. The institute wanted 
to support a small interdisciplinary team and invited a proposal. Nelson and I thought it 




would be desirable to find someone with a background in probability and information 
theory to round out our team. A week before the proposal was due Nelson stumbled 
across two articles by a young mathematician with a recently minted Ph.D. from the 
University of Chicago. The articles were impressive. One broke important new ground in 
understanding pattern recognition. The other described how probability theory could be 
used to assess chance hypotheses. At the end of one of the articles was a biosketch 
describing Dembski’s considerable qualifications. In one line it noted that Dembski 
wanted to use his expertise in probability theory to investigate the feasibility of the design 
hypothesis. We had found our man, or so we thought. 




Figure 8.1. William Dembski. Printed by permission from William Dembski. 



Unfortunately, despite concerted effort, Nelson could not locate Dembski, who was 
traveling in Germany. Then, the day before the deadline, I received a call from a friend in 
Texas who wanted to tell me about someone he thought I should know. Sure enough, he 
wanted to tell me about Bill Dembski. He described Dembski as unusually brilliant, yet 
unassuming and pleasantly eccentric. He also confirmed that Dembski definitely shared 
our interests. Upon hearing of our dilemma, he encouraged me to “pencil him in” on the 
proposal and make explanations later. We did. After receiving word that our proposal had 
been accepted — contingent on the willingness of all investigators to participate — Nelson 
and I finally located Dembski and told him what we had done. He was surprised, but 
delighted by our audacity. 

My first meetings with Dembski in Cambridge confirmed my friend’s assessment. 
Dembski had an extraordinary facility with logical and mathematical formalism and a 
desire to use these tools to address real scientific questions. He also exhibited some rather 
endearing eccentricities. He had prepacked a supply of health food for the summer in 
order to avoid the heavy fare in the Cambridge colleges. He also had an odd penchant for 
answering questions with illustrations involving coins and dice without explaining what it 
was he was using these examples to illustrate. By summer’s end, I had enticed him to join 



me for “beans and chips” in college and to begin his answers with straightforward 
assertions before bringing out the dice and coins to illustrate his point. He, in turn, 
introduced me to a way of assessing chance as an explanation for the origin of life. 

During the summer, Dembski, Nelson, and I talked frequently about the problem of 
assessing chance as an explanation. Dembski introduced us to the work of the pioneering 
statistician Ronald A. Fisher. During the 1920s, Fisher had developed a method of 
statistical hypothesis testing that brought greater rigor to discussions of chance. On the 
one hand, Fisher’s method acknowledged that some phenomena could be reasonably 
explained by, or at least attributed to, random processes or chance events. On the other 
hand, Fisher’s work established a clear method for deciding when chance alone could be 
eliminated from consideration as the best explanation. I was intrigued. 

Here’s an example that shows how Fisher’s method of chance elimination works. 
Apparently encouraged by his success at the casino, Slick returns the next day to try his 
hand at the roulette wheel again. The croupier smiles to himself. This is how casinos 
make their money back from the previous day’s big winners — overconfidence. But then a 
most unusual morning unfolds. Over the next few hours, a preternaturally confident Slick 
bets only on the red 16. He wins not just once at the roulette wheel, but 100 times in a 
row (fortunately for the casino, he keeps betting only small amounts, thus not breaking 
the bank). 

The croupier knows, based upon the construction of the wheel, that the probability of the 
ball falling into a given pocket is exactly 1 in 38. Thus, the probability of the ball falling 
into a given pocket 100 times consecutively is extremely small, 1 chance in 38 100 , or 1 
chance in 1 (j In this situation, the casino rightly suspects (and long before the 100th 
spin) that something more than chance is at work. Either the roulette wheel has a 
mechanical defect that causes the ball to catch and stop in the same place every time or 
someone is cheating in some way that has not yet been detected. Either way, given their 
knowledge of the odds in roulette and the construction of the wheel, the casino’s 
managers will conclude that Lady Luck is not the culprit. But why? 

On average the ball should land in the red 16 pocket once every 38 spins. (This is the 
expected waiting time between successes). 4 Dumb luck might allow it to land there two 
or three times in 100 spins, but since it has landed in the winning pocket in every one of 
the last 100 spins, this will excite concern. This event cannot reasonably be attributed to 
the nonnal function of the roulette wheel, because the frequency of the occurrence of the 
event in question has deviated too much from the frequency that the house rightly 
expects. Thus, “chance” — even chance understood in the more substantive way defined 
above — no longer seems an appropriate explanation. 

According to Fisher, chance hypotheses can be eliminated precisely when a series of 
events occurs that deviates too greatly from an expected statistical distribution of events 
based on what we know about the processes that generate those events or what we know 
from sampling about how frequently those events typically occur. Fisher developed a 




statistical method for identifying these variations from expectation. That method depends 
crucially on what he calls a “rejection region.” 5 

Fisher knew that any truly random process would produce a range of events with some 
predictable distribution. In the case of roulette, gamblers will win the game about once 
every 38 times. A few will win less frequently than that; some will win more frequently. 
A very few will win either quite frequently or rarefy. These frequencies can be plotted on 
a curve that shows the frequency of winning on one axis (the x axis) and the number of 
players who achieve a given frequency on the other (the y axis). The resulting distribution 
or pattern is the familiar “bell curve” kn own to students of statistics everywhere (see Fig. 
8.2). Events that conform to our expectations about what should happen fall near the 
center of the bell curve. Events that deviate from expectation fall on the sides of the curve 
or on the tails. Events that deviate dramatically from statistical expectation — like Slick 
winning at roulette 100 times in a row — will fall way out on the tails of the curve, often 
so far out that we have no other examples of such events occurring, at least not by chance 
alone. 

And this was Fisher’s insight. He argued that statisticians could “catch” or “identify” 
events that resulted from factors other than chance by prespecifying a rejection region. A 
rejection region is a place on a bell curve or some similar statistical plot that defines a set 
of outcomes that deviate dramatically from our statistical expectation — so dramatically, 
in fact, that statisticians think they can reasonably conclude that these events did not 
occur from the normal outworking of the statistical process in play. For example, a 
statistician working for a casino may specify as a rejection region any statistically 
significant set of outcomes in which a roulette player wins more than 50 percent of the 
time. When an event occurs that falls within that rejection region — like Slick winning 
100 times in a row — the statistician will reject the chance hypothesis. Further, by 
specifying in advance where those rejection regions lie, Fisher thought statisticians could 
guard against reading patterns or significance into a statistical distribution that wasn’t 
there. Prespecifying rejection regions could prevent “cherry-picking” the data. 6 




Figure 8.2. A bell-curve distribution with a prespecified rejection region shaded. 



Fisher’s method made intuitive sense to me, and I wondered if it could shed light on the 
question of whether random processes were responsible for the origin of biological 
information. But Dembski had other questions on his mind. He was wondering why 
Fisher’s method worked in light of a puzzling feature of probabilistic reasoning. As a 
probability theorist, Dembski knew that intuitively we tend to reject chance as an 
explanation for extremely improbable events. Fisher’s method of chance elimination 
illustrated this. Clearly, the improbability of Slick’s winning every time contributes to our 
skepticism about the chance hypothesis. But there was something puzzling about this 
intuitive conviction as well — something that Dembski pointed out to me. Extremely 
improbable events — events that don’t necessarily indicate anything other than 
chance — happen all the time. Indeed, many extremely improbable events don’t seem to 
justify skepticism about chance at all. 

Dembski illustrated this by asking me to imagine flipping a coin 100 times and then 
writing down the exact sequence of heads and tails that turned up. He pointed out that if I 
was bored enough to do this, I would actually participate in an incredibly unlikely event. 
The precise sequence that occurred would have a probability of 1 chance in 2 100 (or 
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approximately 1 in 10 ). Yet the improbability of this event did not mean that something 
other than chance was responsible. After all, some sequence of that improbability had to 
happen. Why not this one? 

This same puzzle is evident in my example of Slick at the roulette wheel. When Slick 
won roulette 100 times in a row, he participated in an incredibly improbable event. The 
odds of his winning 100 times in a row were incredibly low, 1 chance in 10 158 . (There 
are only 10 80 elementary particles in the observable universe.) Now imagine that Slick 
has not won every time, but instead only two or three times out of 100, more in keeping 
with the expected frequency for winning by chance. Imagine also that each time the ball 
falls in a pocket, Slick records the outcome in hopes of finding a pattern to guide his 
future betting. After 100 spins, Slick has compiled a series of 100 outcomes — red 10, 
black 17, red 28, black 5, and so on. 

What are the odds of this exact series of 100 outcomes occurring? They are exactly the 
same as the odds of Slick winning 100 times in a row. Since the probability of the ball 
falling in any one of the pockets on any given spin is the same each time (1 in 38), any 
specific combination of 100 outcomes will be the same, 1 chance in 38 100 , or 1 chance in 
10 . Even in defeat, Slick will witness an incredibly improbable event. 

The Improbability Paradox 

Here we have a paradox. On the one hand, the improbability of Slick’s winning 100 times 
in a row seemed to be an important factor in determining that something other than 
chance was at work. After all, had Slick won only once or twice in a row — something 
much less improbable — no one would have suspected anything. On the other hand, it was 




easy to identify another event just as improbable as his winning 100 times in a row that 
would not trigger the same recognition. 

The essence of this paradox is that two equally improbable events nevertheless seem as 
though they should be explained in two different ways. This paradox led Dembski to 
think that assessments of probability alone could not be the whole reason for rejecting 
chance. It also led him to ask a series of important questions. 

What, in addition to improbability, justifies excluding chance as an explanation for an 
improbable event? What was Fisher’s method identifying, other than just an improbable 
event, that made it a reliable method for eliminating chance — for catching “something 
else going on”? Clearly, Slick’s winning 100 times in a row aroused justifiable suspicion 
of cheating, whereas his winning only twice out of 100 did not. What was his method 
identifying in one case, but not in the other? 

As Dembski thought about this, he realized that Fisher was introducing another factor 
into his analysis. Fisher’s method was not just analyzing an improbable event (or 
sequence of outcomes); it was looking for a significant pattern within that larger event (or 
sequence). Fisher’s method compared the frequency of occurrence of certain kinds of 
outcomes within a larger event to a statistical norm. For example, it compared the 
frequency of Slick’s winning bets to the statistical norm for roulette players. Dembski 
realized that by defining certain kinds of outcomes, the frequency of winning, for 
example, as significant, Fisher had introduced an extra-probabilistic factor into his 
analysis. In other words, Fisher’s method required statisticians to deem certain kinds of 
events as significant. But Dembski realized that “a significant kind of event” is the same 
thing as a pattern. 

Events at the roulette wheel again prove instructive. Consider two kinds of 
outcomes — two patterns — that would lead statisticians to reject a chance hypothesis. 
First, while testing a new roulette table, the croupier at the gaming house finds that the 
ball repeatedly falls in the same pocket. Yet on the chance hypothesis that the wheel is 
working properly and there has been no outside interference, the odds of the ball 
repeatedly falling in exactly the same pocket — say, 100 times in a row — are very low. 
The croupier suspects that something beyond chance is in play, but again, not just 
because the probability of the observed outcome is incredibly small. Clearly, in this case, 
a pattern has emerged. The same physical event keeps recurring, though not one that has 
any effect on a game, since the roulette wheel is being tested only at the moment. What 
should the croupier conclude? The recurrence of the exact same physical outcome 
suggests to him some physical defect with the wheel. He investigates. Sure enough, there 
is an inordinately high lip on the backside of the red 16 pocket. Each time the ball rolls 
over the red 16 it collides with the lip and falls — by necessity, not chance — into the same 
pocket. 

Now consider a second case in which a different kind of pattern triggers the rejection of 
the chance hypothesis. In this case, Slick repeatedly changes his bet, but nevertheless 




always wins. The exact sequence of bets manifests no obvious physical pattern (such as 
the ball repeatedly falling in the red 16). Instead, a different kind of pattern emerges. In 
this case, a seemingly random sequence of different physical outcomes produces a 
significant outcome in the game. A specific sequence of physical events combines to 
achieve a significant objective — Slick winning a lot of money. Unlike the previous case, 
where the pattern was evident simply by observing the event itself, the pattern in this case 
corresponds to an improbable series of events that has significance beyond the physical 
boundaries of the roulette wheel, a significance based on other things the croupier or 
statistician knows (i.e., the object and rules of the game of roulette). 

In this case, the statistician or croupier will again reject the chance hypothesis (that Slick 
won fairly and the wheel is working properly) and for a good reason. Once again an 
outcome has occurred that would be — if nothing but chance were at work — incredibly 
improbable. But beyond that the statistician also recognizes in that outcome a pattern of 
events that advances a goal or performs a function, namely, winning the game and 
money. Rejection regions often implicitly identify patterns of events that advance some 
goal or perform some function. Thus, the rejection region that a statistician uses to 
eliminate chance in this case will not only correspond to an improbable sequence of 
events, but also to an improbable sequence of victories in the game. It identifies a 
frequency of winning that deviates dramatically from expectation. And since winning is a 
significant objective that agents routinely seek to achieve, a highly improbable string of 
winning bets suggests the activity of agents. It suggests cheating, not chance. Conversely, 
the gaming commission might suspect that the casino was cheating if the roulette wheel 
inflicted a disproportionately high rate of losses on customers, a rate consistently and 
significantly higher than the roulette wheel would generate if it were governed purely by 
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chance. 

Thus, Dembski realized that when we use a statistical plot to track the frequency of some 
event occurring, we are implicitly looking for a kind of pattern, a pattern of events that 
suggests something other than chance alone. A pattern of events may present itself as 
interesting because some physical state of affairs occurs repeatedly. Or a pattern of events 
may appear interesting because it has some independent functional significance apart 
from the physical features of the events. Rejection regions sometimes identify repeating 
patterns; other times they correspond to what I call “functionally significant patterns.” 
Either way, Dembski realized that implicit in Fisher’s method of defining rejection 
regions around “a frequency of occurrence that deviates dramatically from expectation” 
was the identification of specific kinds of events — events that manifested significant or 
salient patterns. 

The role of pattern recognition is even easier to see in cases where we reject the chance 
hypothesis without the use of a formal statistical method such as Fisher’s. Consider 
another example that illustrates the importance of pattern recognition to the elimination 
of chance. 




During my first year as a college professor, I was assigned to help teach a team-taught 
course on the history of Western philosophy. During a grading session with the other 
professors, my senior colleagues read samples of various student papers aloud to 
showcase both model papers and typical student errors. As I listened, I had a sudden 
recognition. “Hey,” I said, “I’ve read that paper.” One of the other profs chuckled and 
said, “I know, they all sound the same after a while.” But I persisted. “No,” I said, “I 
mean, I’ve read that exact paper, really.” As my colleague continued to read portions of 
this student essay on Plato, I rifled through my stack. About halfway down the pile, I saw 
the line he was reading and I finished his sentence for him. He looked up at me and read 
some more. I finished his sentence again. An antiphonal reading ensued. Groans arose 
from the teaching assistants and the other professors on the team. The two papers were 
exactly the same. The students were eventually confronted. After some emotion and 
vague denials, one of the two students confessed. It was exactly what it 
seemed — plagiarism. 

After meeting Dembski, I realized that my close encounter with a student plagiarist aptly 
illustrated his ideas about why we eliminate chance hypotheses — in this case, the 
hypothesis that the two students had just happened to write the same three typed pages of 
the same English characters in the same sequence without one having copied the other or 
both having copied a common source. 

Why did my colleagues and I eliminate the chance hypothesis in this case? Clearly, the 
odds did have something to do with it. A typical three-page double-spaced college essay 
has about 1,000 precisely sequenced English words, about 6,000 characters and spaces. 
The probability of one student generating the exact same sequence of 1,000 words 
independently, even if writing on the same topic, was almost incalculably small. Our 
awareness of the incredible improbability of this event occurring by chance contributed to 
our judgment that plagiarism was involved. After all, if the students had just replicated 
two or three short words or a short phrase, we would have had lesser grounds for 
suspicion, no doubt because the probability of generating a few short repeats was much, 
much higher. 

Even so, my discussions with Dembski made me aware that the odds weren’t the whole 
of the matter in such cases. The odds of getting any specific sequence of 6,000 characters 
is roughly the same as the odds of getting a sequence of 6,000 characters that exactly 
matches the sequence of 6,000 characters in another text. So what was the difference? In 
one case an improbable event matched another, and in the other case the event stood 
alone — that is, no such match occurred. 

And that was Dembski’s key insight. It wasn’t the improbability of an event alone that 
justified the elimination of chance. It was the match between one event and a pattern that 
we recognize from another. In detecting plagiarism, I had recognized not only an event I 
knew intuitively to be incredibly improbable (a specific sequence of English words and 
characters), but a pattern in that event that I knew from a completely separate event, 
namely, my reading of another student’s essay. It was the match, the convergence of an 




improbable event with another pattern of letters and words that made me aware of 
plagiarism. Pattern recognition helped trigger the rejection of chance. 



Dembski later developed his ideas about pattern recognition in more detail. He argued 
that some kinds of patterns we recognize can, in conjunction with an improbable event, 
reliably indicate the activity of a designing intelligence. Others types of patterns — highly 
repetitive patterns of the same physical outcome — such as a coin turning up heads 100 
times in a row might indicate the presence of some physical cause responsible for 
producing that outcome by necessity. In still other cases, we fabricate or imagine patterns 
rather than recognize them, and when this occurs the patterns may indicate nothing at all. 
(In Chapter 16, I discuss how Dembski developed a test to identify and exclude such 
phony patterns from consideration.) In any case, I came to realize, based on my 
discussions with Dembski, that pattern recognition played a key role in the rejection of 
chance. 

I also realized that this made sense in light of my own earlier philosophical reflections 
about what scientists mean by “chance.” If a substantive chance hypothesis necessarily 
negates or nullifies explanations involving physical-chemical necessity and design, then 
the presence of a pattern necessarily negates chance. The chance hypothesis in effect 
says, “There is nothing going on in this event to indicate any regular or discernable causal 
factors.” Since patterns signal the presence of deeper causal factors or regularities at 
work, the presence of patterns negates chance. Because patterns negate the null 
hypothesis (that “nothing is going on”) and the null hypothesis is the chance hypothesis, 
patterns negate the chance hypothesis. Patterns negate the negation — the negation 
entailed in a substantive chance hypothesis. 



Ex Post Facto Pattern Recognition and Singular Events 

As Dembski was thinking about the role that pattern recognition plays in chance 
elimination — and the way rejection regions define significant patterns — he had another 
key insight. He realized that it wasn’t necessary to anticipate or prespecify a pattern (or 
rejection region) in order to justify rejecting chance, as Fisher’s method required. 
Dembski regarded Fisher’s condition as too restrictive, since he could think of many 
examples in which recognizing a pattern after the fact, rather than specifying a rejection 
region before it, obviously justified the elimination of the chance hypothesis. 

For example, Dembski cites the case of a spy listening to and recording the 
communication of an adversary in wartime. Before an encoded message is decrypted, the 
spy can’t tell whether the combinations of sounds are occurring by chance. Nor can he 
specify in advance the exact combination of sounds that would induce him to reject the 
chance hypothesis — the hypothesis that he has been listening to random noise or static. 
Once the spy uncovers the cryptographic key and is able to decode the signals as 
meaningful text, however, he realizes — after the fact — that his intercepted signals were 
not random. As Dembski notes, “In contrast to statistics, which always identifies its 
patterns before an experiment is performed, cryptanalysis must discover its patterns after 
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the fact.” This same phenomenon occurs daily for most people as they reflexively reject 
chance as an explanation for patterns they perceive in events that have already happened, 
whether a mechanical device or a pattern evident in natural objects such as crystals or 
sand dunes. 

As the result of his deliberations, Dembski developed a method of chance elimination 
that made the importance of pattern recognition more explicit than it had been in Fisher’s. 
Just as historical scientists can often successfully explain events after the fact that they 
could not have predicted before the fact, Dembski argued that it was often possible to 
discern a pattern after the fact that could not have been specified beforehand (i.e., that 
was not prespecified). This allowed historical scientists to use his method to evaluate 
chance explanations of past events — events that had already occurred and would not 
recur. Since the origin of life constituted precisely such an event, his method could be 
used to evaluate the chance hypotheses for the origin of life and biological infonnation. 

Moreover, as I explain in the next chapter, although the scientists working on the origin 
of life rarely articulate specific criteria for evaluating chance hypotheses, they typically 
employ (if implicitly) Dembski’s criteria in assessing chance explanations for the origin 
of life. Clearly, Dembski was on to something. 



One Final Factor: Probabilistic Resources 

It was now clear to me that the occurrence of an improbable event alone does not justify 
eliminating the chance hypothesis. Even so, it was also clear that judgments about 
probability did play an important role in our reasoning about whether something beyond 
chance is at work. My colleagues and I would not have accused our students of cheating 
had they written two essays that just happened to use a few of the same phrases. Nor 
would a gambler suspect that a roulette table had a physical defect just because the ball 
landed in the same pocket twice in a row. Even though patterns are recognizable in these 
events, the events in question are simply not improbable enough to convince us of 
wrongdoing. 

Pattern recognition can lead us to suspect that something more than chance is at work. 
But the presence of a pattern alone does not justify rejecting chance, any more than an 
improbable event alone, in the absence of a pattern, justifies chance elimination. Instead, 
both the occurrence of an improbable event and the recognition of a pattern are 
necessary. Or to make the point using a logical distinction: the presence of a pattern is a 
necessary, but not by itself a sufficient condition of chance elimination. Instead, a 
quantitative judgment of improbability is also involved. 

But this raises another question. How improbable does an event have to be to justify the 
elimination of a chance hypothesis? If we begin to detect a pattern in an improbable 
sequence of events, at what point should our doubts about the chance hypothesis lead us 
to reject it as untenable or unreasonable? As a blackjack player repeatedly wins against 
the house, or a ball repeatedly falls in the same hole, or as a die repeatedly comes up on 




the same side, or as we observe an event that implies that great odds were overcome in 
the past — at what point do we finally conclude that enough is enough, that something 
besides chance must be in play? How improbable is too improbable? 

The answer is, “That depends.” It depends on the improbability of the event in question. 
But it also depends upon how many opportunities there are to generate the event. Here 
again my discussions with Dembski helped me think this through. He noted that there 
was another common fallacy in reasoning about chance and probability. He had already 
shown me that some people overlook the importance of pattern recognition in our 
reasoning about chance. He now explained that in the case where an improbable event 
manifests a pattern, there is another consideration. What matters, he argued, is not the 
probability of a particular event or outcome occurring on one occasion (or in one trial), 
but the probability of generating that event given all the opportunities (or trials) that it has 
to occur. As Christian de Duve, a Nobel Prize-winning biochemist who has written 
several books about the origin of life, explains, “What counts [in assessing the probability 
of a chance hypothesis] is the number of opportunities provided for a given event to 
happen, relative to the event’s probability.” 9 

Let’s go back to the roulette table to illustrate this point. It is moderately improbable that 
the roulette ball will land in the red 16 pocket twice in a row. In two spins of the wheel 
there are 38 x 38, or 1,444, possible outcomes. Thus, there is just one chance in 1,444 of 
the ball landing on the red 16 twice in a row. Nevertheless, if the croupier has 1,444 
opportunities to spin the wheel in a week, he will quite likely witness this otherwise 
unlikely event sometime during that time. Though the probability of getting two red 16s 
in just two spins of the roulette wheel is small, the probability of the event occurring by 
chance within the week is much higher. The croupier has what Dembski calls the 
“probabilistic resources” — the time and number of trials — necessary to render this event 
probable. 

Dembski uses a coin-tossing scenario to illustrate the concept of probabilistic resources 
and to point up the human tendency to overestimate the power of our probabilistic 
resources in overcoming astronomically long odds. He tells a story of a man who had 
been arrested and convicted for the crime of running a crooked gambling operation. The 
convict is brought before a judge for sentencing. The judge, a gambling man himself, 
decides to offer the convict a choice. The judge offers the criminal either ten years in 
prison or a tenn in prison that is determined by the outcome of a game of chance. 
Specifically, the judge tells the criminal that he can leave prison as soon as he flips a coin 
and it turns up heads 100 times in a row — with no tails interrupting the sequence. The 
judge stipulates that the coin must be fair, biased toward neither heads nor tails. Dembski 
asks, “What’s the smart criminal to do?” Should he take the coin- flipping option or the 
ten years? 

Here the criminal’s intuition might lead him astray. He might think, “Even if it takes two 
or three years, I’ll be better off than sitting in prison for a guaranteed ten years.” But 
imagine that before he takes the coin option, his accountant nudges him from the second 




row, hands him a calculator, and, whispering in his ear, reminds him that the judge is one 
of the people he fleeced with his crooked casino. The criminal stares at the calculator, 
perplexed. What’s he supposed to do with that? “Two to the 100th power,” his accountant 
whispers. Then it all comes back to him. As a casino operator, he knows a little about 
calculating probabilities. He punches 2 100 into the calculator and is greeted by the figure 
1.2676506 x 10 . The figure, he suddenly realizes, is a life sentence without any realistic 
possibility of parole. It means that for any given 100-toss series, the man would have a 
paltry 1 chance in 1,267,650,600,000,000,000,000,000,000,000 of getting all heads. That 
ten-year window isn’t enough to give him a realistic chance of flipping 100 heads in a 
row. A hundred years isn’t enough. A trillion years wouldn’t be enough. The criminal 
should ignore his initial intuition and trust his accountant’s calculator. 

Dembski explained that if a prisoner flipped a coin at the reasonable rate of once every 
five seconds for eight hours a day, six days a week, for ten years he would generate only 
17,797,120 (-1.7 x 1 0 7 ) trials. 10 

Yet this number represents a minuscule portion of the total number of outcomes possible 
(roughly 10 7 trials out of 10 30 total possibilities). Even taking the number of trials into 
account, the odds of generating 100 heads in a row in ten years increases to only roughly 
1 chance in 10 23 (i.e., 10 7 /10 3 °), or 1 in 100,000,000,000,000,000,000,000. Yet this 
number is obviously far smaller than 1 in 2 or the 50 percent probability that we usually 
use to gauge a “reasonable chance of success.” Thus, it is vastly more likely than not that 
the criminal will not generate the necessary sequence of 100 heads in a row by chance 
within ten years. He doesn’t have the necessary “probabilistic resources.” 11 

My discussions with Dembski revealed that there are situations in which a chance 
hypothesis can be reasonably rejected. Sometimes there are sufficient probabilistic 
resources to make an unlikely event probable (e.g., two red 16s in a row at the roulette 
table over the course of a week). And sometimes there are not (as illustrated by 
Dembski’s coin-flipping convict). Scientists can never absolutely prove that some 
astronomically improbable event could not have taken place by chance. But they can 
detennine when it is more likely than not — even vastly more likely than not — that such 
an event won’t or didn’t happen by chance alone. 

To see this, turn the case of the coin-flipping criminal around. Imagine the convict arrives 
in prison and, having accepted the judge’s challenge, begins to flip a quarter in front of a 
surveillance camera as required by the court to verify any winning result. After two days, 
the criminal calls for a prison guard and asks to have the tape reviewed. After viewing the 
tape of the criminal’s last 100 flips, the guard and the warden of the prison verify that, 
yes indeed, the prisoner flipped 100 heads in a row. What should the court conclude 
about the prisoner’s unexpected success? That the prisoner got lucky on one of his first 
attempts? This is, of course, a possible, though incredibly improbable, outcome on the 
chance hypothesis. But because the court sees a pattern (or functional significance) in the 
event, and because it knows something about the odds of this event and the prisoner’s 
probabilistic resources — that the prisoner had few opportunities to beat the odds — the 




court suspects that the prisoner cheated. So they investigate before granting release. Sure 
enough, they find that the prisoner snuck a biased quarter into his cell. 



Tools for Further Investigation 

So what about the origin of life? Could the chance hypothesis explain the origin of the 
specified infonnation necessary to produce life in the first place? 

As I reflected on Fisher’s method of statistical hypothesis testing and Dembski’s insights 
about pattern recognition and probabilistic resources, I realized that I had some analytical 
tools with which to address the specific questions that interested me. My investigation of 
how chance hypotheses are generally evaluated suggested a specific line of inquiry and 
raised several questions that I would need to answer. Is the chance hypothesis for the 
origin of biological infonnation a substantive hypothesis or merely a vacuous cover for 
ignorance? If substantive, are there reasons to affirm the adequacy of the chance 
hypothesis? Or are there grounds for eliminating chance from consideration, at least as 
the best explanation? 

I also wondered: Is there any pattern in the sequence of bases in DNA of the kind that 
routinely leads us to doubt chance hypotheses in other situations? If so, how improbable 
is the origin of the information necessary to produce the first life? And finally, what are 
the relevant probabilistic resources for generating the information in DNA, RNA, and 
proteins, and are they sufficient to render these presumably improbable events probable? 
The next two chapters describe what I discovered as I addressed these questions. 
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Ends and Odds 



As the result of my work with Bill Dembski in the summer of 1992, I now had a grid by 
which to evaluate appeals to chance. After I returned from Cambridge that summer to 
resume my teaching duties at Whitworth College, I began to reexamine specific proposals 
for how the information necessary for the origin of life might have arisen by chance. 
Nevertheless, as an assistant professor with a heavy teaching load and a father of young 
children, I didn’t make as much progress on my research over the next few years as I had 
optimistically projected during my heady summer abroad. That might have turned out for 
the best, however. I didn’t realize it at the time, but I would be in a much better position 
to evaluate chance as an explanation for the origin of life after Dembski and another 
scientist I met in Cambridge that summer had completed their own pieces of the research 
puzzle. 

Even so, I began to evaluate the chance hypothesis as best I could. As I did, I 
rediscovered something that had surprised me when I first encountered it during my 
graduate studies in England. Most of the leading scientists investigating the origin of 
life — none sympathetic to the design hypothesis — were deeply skeptical of chance alone. 
Moreover, I now realized that though these scientists rarely articulated explicit criteria for 
rejecting the chance hypothesis, they intuitively presupposed criteria that closely 
resembled those expressed in Fisher’s method of chance elimination, or Dembski’s, or 
both. Some also questioned the legitimacy of “chance” because they suspected that it 
functioned only as a cover for ignorance. 

This skepticism had been, at first, surprising to me, because informal discussions of the 
origin of life and even some college-level textbooks often gave the impression that 
“chance” was the major factor in the origin of life. For example, in a memorable passage 
of his popular college textbook, Biochemistry, Albert L. Lehninger described how 
inanimate matter crossed the great divide to become alive. “We now come to the critical 
moment in evolution,” he wrote in 1970, “in which the first semblance of ‘life’ appeared, 
through the chance association of a number of abiotically formed macromolecular 
components.” 1 Earlier, in 1954, biochemist George Wald argued for the causal efficacy 
of chance in conjunction with vast expanses of time. As he explained, “Time is in fact the 
hero of the plot.... Given so much time, the impossible becomes possible, the possible 
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probable, and the probable virtually certain.” Similarly, in 1968, Francis Crick suggested 

'J 

that the origin of the genetic code might be a “frozen accident.” 

Most chance hypotheses assumed, sensibly enough, that life could not originate without 
biological information first arising in some fonn. In extant cells, DNA stores the 



information for producing the proteins that perform most critical functions. It follows 
then that scientists must explain either (a) where the information in DNA (or perhaps 
some other equivalent information-bearing molecule) came from or (b) how proteins 
might have arisen directly without DNA (or some equivalent source). Chance-based 
models have taken both these approaches. In some scenarios, scientists proposed that 
DNA first arose by chance and then later came into a functional association with the 
protein molecules that are needed to transcribe and translate genetic information into 
proteins. In other scenarios, scientists proposed that protein molecules arose first by 
chance and then later came into association with DNA, RNA, and the other molecules 
that are now part of the modern system of gene expression. More recently, a third 
approach suggests that infonnation first arose in RNA molecules and that RNA then 
functioned initially both as a carrier of genetic information (as DNA does now) and as an 
enzymatic catalyst (as some proteins do now), although in this model chance is often 
coupled with other mechanisms (see Chapter 14). 

In nearly all these scenarios, origin-of-life theorists first envisioned some favorable 
prebiotic environment rich in the building blocks out of which DNA, RNA, and proteins 
are made. Following Oparin’s early theory proposing that these chemical building blocks 
collected in the earth’s early ocean, this hypothesized environment acquired the 
whimsical name “prebiotic soup.” Scientists typically imagined subunits of the DNA, 
RNA, and protein molecules floating freely in this soup or in some other favorable 
environment. The prebiotic soup would not only provide a rich source of the necessary 
chemical constituents for building proteins and nucleic acids, but it would also afford 
many opportunities for these building blocks to combine and recombine to form larger 
biologically relevant molecules. 

Thus, chance theories for the origin of biological information typically envision a process 
of chemical shuffling. This process eventually would produce large specifically 
sequenced biopolymers (DNA, RNA, and proteins) by chance starting from an 
assortment of smaller molecules (such as amino acids in the case of proteins, or bases, 
sugars, and phosphates in the case of DNA and RNA). Some have likened this process to 
a “cosmic jackpot.” Just as a series of lucky bets might win a gambler a lot of money, so 
too, perhaps, did a series of fortuitous collisions and connections between chemical 
building blocks produce the first functional proteins or infonnation-rich nucleic acids. 

But does chance provide an adequate explanation for the origin of biological 
information? Or does it merely conceal ignorance of what actually happened? Do 
chance-based theories cite processes that are known to generate infonnation-rich 
structures with some known or regular frequency? Or are there good reasons to reject 
chance as the best explanation for the origin of the information necessary to life? 

These were the questions that animated my investigation of the chance hypothesis in the 
years that followed my initial conversations with Bill Dembski. I knew that many 
origin-of-life research scientists were deeply skeptical about explanations that invoked 




chance alone to explain the origin of life. As I investigated this question, I became 
convinced that this skepticism was justified — and for several significant reasons. 



Functionally Significant Patterns 

First, the bases in DNA and RNA and the sequence of amino acids in proteins do not 
contain mere Shannon infonnation. Rather, these molecules store infonnation that is also 
functionally specified. As such, they manifest one of the kinds of patterns — a functionally 
significant pattern — that routinely lead statisticians to reject chance hypotheses, at least in 
the case of extremely improbable events. 

To see why the presence of specified information in DNA poses a difficulty for the 
efficacy of chance processes, recall again our hapless bureaucrat Mr. Jones. When Mr. 
Jones arrived at the airport in Washington, D.C., having lost Mr. Smith’s phone number, 
he did not despair. He knew that by randomly arranging the numbers 0, 1,2, 3, 4, 5, 6, 7, 
8, and 9 he could produce a large amount of Shannon information. Of course, he quickly 
discovered that he did not need just unspecified information; he needed specified 
information. Being naive about the difference, however, Jones thought that he could 
arrange and rearrange the ten digits at random to solve the problem by making other 
numbers. And so he did. He tried 258-197-7292. He tried 414-883-6545. He tried 
279-876-2982 and 867-415-5170 and 736-842-3301. He tried hundreds of numbers. In 
nearly every case, he succeeded only in getting a recorded message from the phone 
company. Twice a person answered only to tell him that he had a wrong number. Jones 
kept trying, but he eventually exhausted himself. He simply did not have enough time to 
have a reasonable chance of finding the correct (i.e., specified) phone number by chance 
alone. 

Had Jones been more perceptive, he would have learned an important lesson. Arranging 
and rearranging characters at random — by chance — can produce lots of unspecified 
information. It is not, however, an effective way to produce specified information, 
especially if the time available in which to do so is limited. 

During my first few years teaching, I developed another illustration to help students see 
why many origin-of-life researchers had come to reject the idea that the specified 
information necessary to the first life could have arisen by chance alone. I would take a 
bag of Scrabble letters and walk through the aisles of my classroom asking each student 
to pull out one letter at random from the bag and then walk to the chalkboard and copy 
that letter on the board. When all had done so, I pointed out that what we had produced 
was a sequence rich in Shannon infonnation, but no specified information. The sequence 
produced at random was complete gibberish. I would then challenge the class to produce 
a meaningful sentence, any meaningful sentence, using this random method. 

Occasionally, as if to resist the point of my demonstration, the students would persist in 
their efforts until they had produced a very short sequence of letters that showed some 
potential for spelling something meaningful, like “pie” or “nes” or even a short word 




such as “ran.” On these occasions, my students would begin to hoot as if they had shown 
me up — which was exactly what I was hoping they would do. When this happened, I 
would stop and quickly calculate the amount of specified infonnation they had produced. 
I would then compare that amount to the amount of information in a short sentence (or 
functional gene) and ask them to continue selecting letters at random until they had 
produced an equivalent amount of specified information — a complete sentence or even a 
long word or two. Invariably the same thing would happen. 

After several more selections, whatever promise the students had detected in their initial 
string of characters was quickly diluted by more gibberish. After their failure became 
apparent, I would challenge them to start over. No matter how many times they did, they 
could not produce a long word, let alone a meaningful sentence of any considerable 
length. By that point, my students saw the moral of the story: a random search might 
produce a small amount of specified infonnation, but not very much, especially, again, if 
the time available to search the possible combinations was limited. 

Many origin-of-life scientists have similarly recognized how difficult it is to generate 
specified biological infonnation by chance alone in the time available on the early earth 
(or even in the time available since the beginning of the universe). As one leading 
biochemist, Alexander Cairns-Smith, wrote in 1971: “Blind chance... is very limited. 
Low levels of cooperation he [blind chance] can produce exceedingly easily (the 
equivalent of letters and small words), but he becomes very quickly incompetent as the 
amount of organization [information] increases. Very soon indeed long waiting periods 
and massive material resources become irrelevant.” 4 

In saying things like this, scientists have recognized that the crucial problem is not just 
generating an improbable sequence of chemical constituents (e.g., an improbable 
arrangement of nucleotide bases in DNA). Scientists understand that improbable events 
happen by chance. Instead, the problem is relying on a random search or shuffling to 
generate one of the rare arrangements of molecules that also performs a biological 
function. It’s like the difference between shooting an arrow blindfolded and having it land 
anywhere and shooting an arrow blindfolded and having it hit the target. It’s much harder 
to do the latter than the former. 

Christian de Duve has devised an illustration that underscores the importance of pattern 
recognition (and the recognition of functional specificity) in our reasoning about chance 
hypotheses. He points out: “A single, freak, highly improbable event can conceivably 
happen. Many highly improbable events — drawing a winning lottery number or the 
distribution of playing cards in a hand of bridge — happen all the time. But a string of 
[such] improbable events — drawing the same lottery number twice, or the same bridge 
hand twice in a row — does not happen naturally.” 5 

Here de Duve suggests, like Fisher and Dembski, that an improbable event or set of 
conditions does not alone provide grounds for rejecting a chance hypothesis. Instead, his 
illustration indicates that pattern recognition also plays a key role. As the same person 




repeatedly wins the lottery or as the same hand of bridge recurs, a pattern 
emerges — indeed, one that I have been calling “a functionally significant pattern.” In de 
Duve’s examples, an improbable sequence of events has occurred that also produces a 
significant objective — winning a game or winning money. De Duve suggests that when 
we recognize that a specific sequence of events has produced a significant outcome — one 
that achieves a goal or performs a function — we rightly suspect something more than 
chance at work. 

For this reason, he suggests that we should doubt the chance hypothesis for the origin of 
biological infonnation. According to the chance hypothesis, as the molecular building 
blocks of DNA, RNA, and proteins interact in a prebiotic environment, these molecular 
subunits would have attached to one another to form long chainlike molecules. Amino 
acids would have attached to other amino acids to form a growing polypeptide chain. 
Sugar and phosphate molecules would have attached to each other. Nucleotide bases 
would have attached to sugar-phosphate backbones in various arrangements. Thus, the 
advocates of the chance hypothesis envision such chains growing molecule by molecule 
(or “letter by letter”). 

But to form functional genes and proteins, not just any arrangement of bases or amino 
acids will do. Not by any means. The overwhelming majority of arrangements of bases 
and amino acids perform no biological function at all. So the assembly of working 
proteins, for example, in the prebiotic soup would have required functionally appropriate 
amino acids to attach themselves by chance to a growing chain of other amino 
acids — time after time after time — despite many opportunities for other less fortuitous 
outcomes to have occurred. De Duve seems to liken this process to a gambler repeatedly 
drawing the winning card or the high hand. In both cases he suggests that an astute 
observer should detect a pattern — a pattern of events that produces a significant 
outcome . 6 

De Duve insists that as unlikely patterns of fortuitous events emerge, our doubt about the 
chance hypothesis should increase. Just as we doubt that a card player could draw the 
same improbable winning hand time after time unless he is cheating, so too should we 
doubt that appropriate amino acids or bases would happen by chance to attach themselves 
repeatedly to a growing protein or DNA molecule. In other words, we should be skeptical 
that the random interaction of molecular building blocks will produce functionally 
specified sequences in DNA (or proteins) by chance alone. 

This makes sense. We kn ow from experience that random searches are incredibly 
inefficient means of generating specified sequences of characters or events — as my 
Scrabble-bag challenge demonstrated to my students. Thus, when we encounter 
functionally specified sequences of characters (or molecules that function 
equivalently) — or when we encounter a functionally significant pattern in a sequence of 
events (as we do in de Duve’s gambling examples) — we are correctly skeptical that 
chance alone is at work. DNA contains specified sequences of chemicals that function 
just like digital or alphabetic characters: it contains functionally specified infonnation, 




not just Shannon infonnation. As scientists such as de Duve have recognized this, they 
understandably have become more skeptical about chance as an explanation for the origin 
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of biological information. 

The Improbability of Producing Genes and Proteins by Chance 

There is a second reason for doubting the chance hypothesis as an explanation for the 
origin of biological infonnation. Building a living cell not only requires specified 
information; it requires a vast amount of it — and the probability of this amount of 
specified information arising by chance is “vanishingly small.” But how small? What 
exactly is the probability that the infonnation necessary to build the first living cell would 
arise by chance alone? 

It turns out that it was initially hard to quantify the answer to this question, because 
biologists didn’t know exactly how much information was necessary to build and 
maintain the simplest living cell. For one thing, they didn’t know how many of the parts 
of existing cells were necessary to maintain life and how many weren’t. But beginning in 
the 1990s, scientists began to do “minimal complexity” experiments in which they tried 
to reduce cellular function to its simplest form. Since then, biologists have been able to 
make increasingly informed estimates of the minimum number of proteins and genes that 
a hypothetical protocell might have needed to survive. 

The simplest extant cell, Mycoplasma genitalium — a tiny bacterium that inhabits the 
human urinary tract — requires “only” 482 proteins to perform its necessary functions and 
562,000 bases of DNA (just under 1,200 base pairs per gene) to assemble those proteins. 
In minimal-complexity experiments scientists attempt to locate unnecessary genes or 
proteins in such simple life-forms. Scientists use various experimental techniques to 
“knock out” certain genes and then examine the effect on the bacterial cell to see if it can 
survive without the protein products of the disabled genes. Based upon 
minimal-complexity experiments, some scientists speculate (but have not demonstrated) 
that a simple one-celled organism might have been able to survive with as few as 
250-400 genes. 

Of course building a functioning cell — at least one that in some way resembles the cells 
we actually observe today — would have required more than just the genetic information 
that directs protein synthesis. It would have also required, at the very least, a suite of 
preexisting proteins and RNA molecules — polymerases, transfer RNAs, ribosomal 
RNAs, synthetases, and ribosomal proteins, for example — to process and express the 
information stored in DNA. In fact, there are over 100 specific proteins involved in a 
simple bacterial translation system; roughly 20 more are involved in transcription and 
over 30 in DNA replication. 8 Indeed, although the infonnation in DNA is necessary for 
building proteins, many proteins are necessary for expressing and processing the 
information in DNA (see Chapter 5). Extant cells, therefore, need both types of 
information-rich molecules — nucleic acids and proteins — functioning together in a tightly 
integrated way. Therefore, any minimally complex protocell resembling cells we have 




today would have required not only genetic information, but a sizable preexisting suite of 
proteins for processing that infonnation. 

Building such a cell also would have required other preexisting components. For 
example, both proteins and DNA are necessary for building and maintaining the 
energy-production and-processing system of the cell, the ten-enzyme glycolysis pathway 
that produces high-energy ATP from its low-energy precursor ADP. But 
information-processing and protein synthesis — and just about everything else in the 
cell — depends upon a preexisting supply of ATP or one of the closely related 
energy-carrying molecules. Thus, for life to arise in the first place, ATP (or related 
molecules) must have also been present along with genes and proteins. 

Beyond that, the first cell would have required some kind of semipermeable membrane 
and a cell wall to protect itself and the chemical reactions taking place inside it. In 
modem bacteria, the protective barrier is constructed by proteins and enzymes that 
polymerize (link together) the smaller molecules out of which both the phospholipid 
bilayer and the cell wall are composed. Cells require a variety of these small 
molecules — phosphates, lipids, sugars, vitamins, metals, ATP — in addition to proteins 
and nucleic acids. Many of these small molecules are synthesized with the help of 
proteins in the cell, but proteins also need these molecules in order to accomplish many 
of their enzymatic functions. Thus, any protocell likely would have required a preexisting 
supply of these smaller molecules to establish and maintain itself as well. 

The integrated complexity of even a “minimally complex cell” has made it difficult to 
calculate the odds of all the necessary components of such a system arising in close 
association with one another by chance alone. Nevertheless, as a first-order 
approximation, many theorists have contented themselves with calculating the probability 
of producing just the information necessary to build the suite of necessary proteins, while 
recognizing that many more components would also be required and that the probability 
thus calculated vastly underestimates the difficulty of the task at hand. 

To calculate this probability, scientists typically use a slightly indirect method. First they 
calculate the probability of a single functional protein of average length arising by chance 
alone. Then they multiply that probability by the probability of each of the other 
necessary proteins arising by chance. The product of these probabilities detennines the 
probability that all the proteins necessary to service a minimally complex cell would 
come together by chance. 

Scientists could, of course, analyze DNA to calculate the probability of the corresponding 
genetic information arising by chance. But since the information in each gene directs the 
synthesis of a particular protein, the two sequences (the gene sequence and its 
corresponding protein sequence) carry the same amount of information . 9 Proteins 
represent, in the parlance of the discipline, the “expressed function” of the coding region 
of a DNA molecule. Further, since measures of probability and infonnation are related 
(albeit, inversely), the probability that a particular gene would arise by chance is roughly 




the same as the probability that its corresponding gene product (the protein that the gene 
encodes) would do so. 

For that reason, the relevant probability calculation can be made either by analyzing the 
odds of arranging amino acids into a functional protein or by analyzing the odds of 
arranging nucleotide bases into a gene that encodes that protein. Because it turns out to be 
simpler to make the calculation using proteins, that’s what most origin-of-life scientists 
have done. 10 In any case by the 1960s, as scientists began to appreciate the complexity of 
both DNA and proteins and the amount of specified information they contain, many 
began to suspect that producing these molecules by chance alone would prove to be quite 
difficult. 



Symposium at the Wistar Institute 

The first public inkling of concern about the chance explanation surfaced at a now 
famous conference of mathematicians and biologists held in Philadelphia in 1966. The 
conference was titled “Mathematical Challenges to Neo-Darwinism.” 1 1 It was held at the 
Wistar Institute, a prestigious medical research center, and chaired by Sir Peter Medawar, 
a Nobel laureate from England. The conference was called to discuss the growing doubts 
of many mathematicians, physicists, and engineers about the ability of random mutations 
to generate the information needed to produce new forms of life. Though the skeptics 
mainly expressed doubt about the role of random mutations in biological evolution, the 
questions they raised had equally important implications for assessing the role of chance 
in chemical evolutionary theories about the first life. 

These doubts were first discussed at an infonnal private gathering in Geneva in the 
mid-1960s at the home of MIT physicist Victor Weisskopf. During a picnic lunch the 
discussion turned to evolution. Several of the MIT math, physics, and engineering 
professors present expressed surprise at the biologists’ confidence in the power of 
mutations to produce new forms of life in the time available to the evolutionary process. 
A vigorous argument ensued, but was not resolved. Instead, plans were made for a 
conference to discuss the concerns of the skeptics. 

According to some of these MIT professors, the neo-Darwinian mechanism faced what 
they called a “combinatorial problem.” In mathematics, the term “combinatorial” refers 
to the number of possible ways that a set of objects can be arranged or combined (the 
relevant branch of mathematics is known as “combinatorics”). Some simple bike locks, 
for example, have four dials with ten settings on each dial. A bike thief encountering one 
of these locks (and lacking bolt cutters) faces a combinatorial problem because there are 
10x 10x 10x 10, or 10,000 possible ways of combining the possible settings on each of 
the four dials and only one combination that will open the lock. Guessing at random is 
unlikely to yield the correct combination unless the thief has a truly generous amount of 
time to do so. 




The Wistar scientists explained that a similar difficulty confronts the Darwinian 
mechanism. According to neo-Darwinian theory, new genetic information arises first as 
random mutations occur in the DNA of existing organisms. When mutations arise that 
confer a survival advantage on the organisms that possess them, the resulting genetic 
changes are passed on by natural selection to the next generation. As these changes 
accumulate, the features of a population begin to change over time. Nevertheless, natural 
selection can “select” only what random mutations first produce. And for the 
evolutionary process to produce new forms of life, random mutations must first have 
produced new genetic infonnation for building novel proteins. That, for the 
mathematicians, physicists, and engineers at Wistar, was the problem. Why? 

The skeptics at Wistar argued that it is extremely difficult to assemble a new gene or 
protein by chance because of the sheer number of possible base or amino-acid sequences. 
For every combination of amino acids that produces a functional protein there exists a 
vast number of other possible combinations that do not. And as the length of the required 
protein grows, the number of possible amino-acid sequence combinations of that length 
grows exponentially, so that the odds of finding a functional sequence — that is, a working 
protein — diminish precipitously. 

To see this, consider the following. Whereas there are four ways to combine the letters A 
and B to make a two-letter combination (AB, BA, AA, and BB), there are eight ways to 
make three-letter combinations (AAA, AAB, ABB, ABA, BAA, BBA, BAB, BBB), and 
sixteen ways to make four-letter combinations, and so on. The number of combinations 
grows geometrically, 2,2,2, and so on. And this growth becomes more pronounced 
when the set of letters is larger. For protein chains, there are 20", or 400, ways to make a 
two-amino-acid combination, since each position could be any one of 20 different 
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alphabetic characters. Similarly, there are 20 , or 8,000, ways to make a three-amino-acid 
sequence, and 20 4 , or 160,000, ways to make a sequence four amino acids long, and so 
on. As the number of possible combinations rises, the odds of finding a correct sequence 
diminishes correspondingly. But most functional proteins are made of hundreds of amino 
acids. Therefore, even a relatively short protein of, say, 150 amino acids represents one 
sequence among an astronomically large number of other possible sequence 
combinations (approximately 10 195 ). 

Consider the way this combinatorial problem might play itself out in the case of proteins 
in a hypothetical prebiotic soup. To construct even one short protein molecule of 150 
amino acids by chance within the prebiotic soup there are several combinatorial 
problems — probabilistic hurdles — to overcome. First, all amino acids must fonn a 
chemical bond known as a peptide bond when joining with other amino acids in the 
protein chain (see Fig. 9.1). If the amino acids do not link up with one another via a 
peptide bond, the resulting molecule will not fold into a protein. In nature many other 
types of chemical bonds are possible between amino acids. In fact, when amino-acid 
mixtures are allowed to react in a test tube, they fonn peptide and nonpeptide bonds with 
roughly equal probability. Thus, with each amino-acid addition, the probability of it 
forming a peptide bond is roughly 1/2. Once four amino acids have become linked, the 




likelihood that they are joined exclusively by peptide bonds is roughly 1/2 x 1/2 x 1/2 x 
1/2 = 1/16, or (1/2) 4 . The probability of building a chain of 150 amino acids in which all 
linkages are peptide linkages is (1/2) 149 , or roughly 1 chance in 10 45 . 

Second, in nature every amino acid found in proteins (with one exception) has a distinct 
mirror image of itself; there is one left-handed version, or L-form, and one right-handed 
version, or D-form. These mirror-image forms are called optical isomers (see Fig. 9.2). 
Functioning proteins tolerate only left-handed amino acids, yet in abiotic amino-acid 
production the right-handed and left-handed isomers are produced with roughly equal 
frequency. Taking this into consideration further compounds the improbability of 
attaining a biologically functioning protein. The probability of attaining, at random, only 
L-amino acids in a hypothetical peptide chain 150 amino acids long is (1/2) 150 , or again 
roughly 1 chance in 10 45 . Starting from mixtures of D-forms and L-forms, the probability 
of building a 150-amino-acid chain at random in which all bonds are peptide bonds and 
ah amino acids are L-form is, therefore, roughly 1 chance in 10 9 °. 



4 4 



Q 

r* 


4 4 c. 

^ L / 

* L v 


4 


\ 




04 


4 4 £>4 


4 



4 o 

i * 

to-L-L 
* \ ^ 



4 4 

04 4 4 04 



l 




4 4 o 4 4 o 4 4 g 4-1 W o 

\ I 4 'v 1 4 \ ^ \ \ * 

— Ki -t t- W~C-C 

s r r i / i S l \ 

4 fe 4 t 4 fe 



Figure 9.1. Amino acids link together when the amino group of one amino acid bonds to 
the carboxyl group of another. Notice that water is a by-product of the reaction (called a 
condensation reaction). 



Functioning proteins have a third independent requirement, the most important of all: 
their amino acids, like letters in a meaningful sentence, must link up in functionally 
specified sequential arrangements. In some cases, changing even one amino acid at a 
given site results in the loss of protein function. Moreover, because there are 20 
biologically occurring amino acids, the probability of getting a specific amino acid at a 
given site is small — 1/20. (Actually the probability is even lower because, in nature, there 
are also many nonprotein-forming amino acids.) On the assumption that each site in a 
protein chain requires a particular amino acid, the probability of attaining a particular 
protein 150 amino acids long would be ( 1/20) 15 °, or roughly 1 chance in 10 . 



Nevertheless, molecular biologists have known for a while that most sites along the chain 
can tolerate several of the different twenty amino acids commonly found in proteins 
without destroying the function of the protein, though some cannot. At the Wistar 
conference this tolerance was cited by some of the evolutionary biologists as evidence 
that random mutations might be sufficient to produce new functional proteins. Most of 
the mathematicians at the conference had assumed that the functional sequences were 
incredibly rare within the space of all the possible combinations. But some of the 
evolutionary biologists asked: What if functional sequences are more common than the 
mathematicians are supposing? This raised an important question, one that could not be 
answered in 1966: How rare, or common, are the functional sequences of amino acids 
among all the possible sequences of amino acids in a chain of any given length? 
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Figure 9.2. Two optical isomers of the same amino acid. 

In the late 1980s, several important studies were conducted in the laboratory of MIT 
biochemist Robert Sauer in order to investigate this question. His research team used a 
sampling technique known as “cassette mutagenesis” to determine how much variance 
among amino acids can be tolerated at any given site in several proteins. This technique 
would help resolve an important issue. If proteins can tolerate a lot of variance, then that 
would increase the probability that a random search through the space of possibilities 
would find a functional sequence. If proteins were more finicky — if the requirements of 
functionality imposed more rigid or restricted constraints on sequencing — then that 
would decrease the probability of a random process successfully producing a functional 
protein. Thus, whatever Sauer’s team discovered would be extremely significant. The 
results of their experiments could help determine the probability that a functional protein 
would arise by chance from a prebiotic soup. 

So what did they find? Their most clear-cut experiments seemed to indicate that, even 
taking the possibility of variance into account, the probability of achieving a functional 
sequence of amino acids in several known (roughly 100 amino acid) proteins at random is 
still “exceedingly small,” about 1 chance in 10 (to put this in perspective, there are 10 
atoms in our galaxy). Using a variety of mutagenesis techniques, they and other 
scientists showed that proteins (and thus the genes that produce them) are highly 
specified relative to biological function. 14 Earlier studies indicated that amino-acid 
residues at many sites cannot vary without functional loss. 15 Now Sauer and others 
demonstrated that even for sites that do admit some variance, not just any amino acid will 



do. Instead, they showed that functional requirements place significant constraints on 
sequencing at sites where some variance is allowed. By quantifying that allowable 
variance, they made it possible to calculate the probability of finding a protein with a 
functional sequence among the larger ensemble of combinatorial possibilities. 

I first learned about the work of Robert Sauer and its relevance to assessing the chance 
hypothesis in 1992 from a postdoctoral researcher at Cambridge University named 
Douglas Axe. Axe and I met in the summer of that year as I was conducting research with 
Bill Dembski. Axe had come to Cambridge to perform mutagenesis experiments that 
were similar to Sauer’s. During his Ph.D. work at Caltech, Axe learned about the 
structure of proteins and the intricate folds they need in order to perform their functions. 
He began to wonder how difficult it was to produce these folds by random mutations or 
random molecular interactions. He began to ask a very similar question to the one that 
had been asked at the Wistar conference: How rare, or common, are the amino-acid 
sequences that produce the stable folds that make it possible for proteins to perform their 
biological functions? 

As Axe began to examine Sauer’s experimental method, he asked whether Sauer might 
have underestimated how much protein sequences can vary and still maintain function. 
To test this possibility, he developed a more rigorous method of estimating this allowable 
variability in order to eliminate possible estimation error. Axe liked what Sauer had done, 
but wanted to produce a more definitive answer. He wanted to settle the key question 
from the Wistar conference beyond any reasonable doubt. His interest in the subject 
eventually led him to the Cambridge University laboratory of Alan Fersht, and then to the 
Center for Protein Engineering at the famous Medical Research Council Centre in 
Cambridge, where the likes of Francis Crick, Max Perutz, John Kendrew, and Fred 
Sanger had worked in the early 1960s. Axe’s work was published in a series of papers in 
the Journal of Molecular Biology, Biochemistry, and the Proceedings of the National 
Academy of Sciences between 1996 and 2004. 





Figure 9.3. Douglas Axe. Printed by permission from Douglas Axe. 



The results of a paper he published in 2004 were particularly telling. 16 Axe performed a 
mutagenesis experiment using his refined method on a functionally significant 
150-amino-acid section of a protein called beta-lactamase, an enzyme that confers 
antibiotic resistance upon bacteria. On the basis of his experiments, Axe was able to 
make a careful estimate of the ratio of (a) the number of 150-amino-acid sequences that 
can perform that particular function to (b) the whole set of possible amino-acid sequences 
of this length. Axe estimated this ratio to be 1 to 10 . 

This was a staggering number, and it suggested that a random process would have great 
difficulty generating a protein with that particular function by chance. But I didn’t want 
to know just the likelihood of finding a protein with a particular function within a space 
of combinatorial possibilities. I wanted to kn ow the odds of finding any functional 
protein whatsoever within such a space. That number would make it possible to evaluate 
chance-based origin-of-life scenarios, to assess the probability that a single protein — any 
working protein — would have arisen by chance on the early earth. 

Fortunately, Axe’s work provided this number as well. Axe knew that in nature 
proteins perform many specific functions. He also knew that in order to perform these 
functions their amino-acid chains must first fold into stable three-dimensional structures. 
Thus, before he estimated the frequency of sequences perfonning a specific 
(beta-lactamase) function, he first performed experiments that enabled him to estimate 
the frequency of sequences that will produce stable folds. On the basis of his 
experimental results, he calculated the ratio of (a) the number of 150-amino-acid 
sequences capable of folding into stable “function-ready” structures to (b) the whole set 
of possible amino-acid sequences of that length. He determined that ratio to be 1 to 10 74 . 




Since proteins can’t perform functions unless they first fold into stable structures, Axe’s 
measure of the frequency of folded sequences within sequence space also provided a 
measure of the frequency of functional proteins — any functional proteins — within that 
space of possibilities. Indeed, by taking what he knew about protein folding into account, 
Axe estimated the ratio of (a) the number of 150-amino-acid sequences that produce any 
functional protein whatsoever to (b) the whole set of possible amino-acid sequences of 
that length. Axe’s estimated ratio of 1 to 10 74 implied that the probability of producing 
any properly sequenced 150-amino-acid protein at random is also about 1 in 10 74 . In 
other words, a random process producing amino-acid chains of this length would stumble 
onto a functional protein only about once in every 10 74 attempts. 

When one considers that Robert Sauer was working on a shorter protein of 100 amino 
acids, Axe’s number might seem a bit less prohibitively improbable. Nevertheless, it still 
represents a startlingly small probability. In conversations with me, Axe has compared 
the odds of producing a functional protein sequence of modest (150-amino-acid) length at 
random to the odds of finding a single marked atom out of all the atoms in our galaxy via 
a blind and undirected search. Believe it or not, the odds of finding the marked atom in 
our galaxy are markedly better (about a billion times better) than those of finding a 
functional protein among all the sequences of corresponding length. 

This was a very significant result. Building on the work of Robert Sauer and others, Axe 
established a reliable quantitative estimate of the rarity of functional sequences within the 
corresponding space of possible amino-acid sequences, thereby providing an answer to 
the question first posed at the Wistar symposium. Though proteins tolerate a range of 
possible amino acids at some sites, functional proteins are still extremely rare within the 
whole set of possible amino-acid sequences. 

In June 2007, Axe had a chance to present his findings at a symposium commemorating 
the publication of the proceedings from the original Wistar symposium forty years 
earlier. In attendance at this symposium in Boston was retired MIT engineering professor 
Murray Eden, with a still incisive mind at the age of eighty-seven. Eden had been one of 
the original conveners of the Wistar conference and was the one who had most forcefully 
explained the combinatorial problem facing neo-Darwinism. Forty years later, Axe’s 
experimental work had now confirmed Eden’s initial intuition: the odds are prohibitively 
stacked against a random process producing functional proteins. Functional proteins are 
exceedingly rare among all the possible combinations of amino acids. 

Axe’s improved estimate of how rare functional proteins are within “sequence space” has 
now made it possible to calculate the probability that a 150-amino-acid compound 
assembled by random interactions in a prebiotic soup would be a functional protein. This 
calculation can be made by multiplying the three independent probabilities by one 
another: the probability of incorporating only peptide bonds (1 in 10 45 ), the probability of 
incorporating only left-handed amino acids (1 in 10 45 ), and the probability of achieving 
correct amino-acid sequencing (using Axe’s 1 in 10 74 estimate). Making that calculation 
(multiplying the separate probabilities by adding their exponents: 1045 + 45 + 74) gives a 




dramatic answer. The odds of getting even one functional protein of modest length (150 
amino acids) by chance from a prebiotic soup is no better than 1 chance in 10 164 . 

It is almost impossible to convey what this number represents, but let me try. We have a 
colloquial expression in English, “That’s like looking for a needle in a haystack.” We 
understand from this expression that finding the needle will be difficult because the 
needle — the thing we want — is mixed in with a great number of other things we don’t 
want. To have a realistic chance of finding the needle, we will need to search for a long, 
long time. Now consider that there are only 10 protons, neutrons, and electrons in the 
observable universe. Thus, if the odds of finding a functional protein by chance on the 
first attempt had been 1 in 10 , we could have said that’s like finding a marked 
particle — proton, neutron, or electron (a much smaller needle) — among all the particles in 
the universe (a much larger haystack). Unfortunately, the problem is much worse than 
that. With odds standing at 1 chance in 10 164 of finding a functional protein among the 
possible 150-amino-acid compounds, the probability is 84 orders of magnitude (or 
powers of ten) smaller than the probability of finding the marked particle in the whole 
universe. Another way to say that is the probability of finding a functional protein by 
chance alone is a trillion, trillion, trillion, trillion, trillion, trillion, trillion times smaller 
than the odds of finding a single specified particle among all the particles in the universe. 

And the problem is even worse than this for at least two reasons. First, Axe’s experiments 
calculated the odds of finding a relatively short protein by chance alone. More typical 
proteins have hundreds of amino acids, and in many cases their function requires close 
association with other protein chains. For example, the typical RNA polymerase — the 
large molecular machine the cell uses to copy genetic information during transcription 
(discussed in Chapter 5) — has over 3,000 functionally specified amino acids. 18 The 
probability of producing such a protein and many other necessary proteins by chance 
would be far smaller than the odds of producing a 150-amino-acid protein. 

Second, as discussed, a minimally complex cell would require many more proteins than 
just one. Taking this into account only causes the improbability of generating the 
necessary proteins by chance — or the genetic infonnation to produce them — to balloon 
beyond comprehension. In 1983 distinguished British cosmologist Sir Fred Hoyle 
calculated the odds of producing the proteins necessary to service a simple one-celled 
organism by chance at 1 in io 40,000 . 19 At that time scientists could have questioned his 
figure. Scientists knew how long proteins were and roughly how many protein types there 
were in simple cells. But since the amount of functionally specified information in each 
protein had not yet been measured, probability calculations like Hoyle’s required some 
guesswork. 

Axe’s experimental findings suggest that Hoyle’s guesses were pretty good. If we assume 
that a minimally complex cell needs at least 250 proteins of, on average, 150 amino acids 
and that the probability of producing just one such protein is 1 in 10 164 as calculated 
above, then the probability of producing all the necessary proteins needed to service a 
minimally complex cell is 1 in 10 164 multiplied by itself 250 times, or 1 in IO 41,000 . This 




kind of number allows a great amount of quibbling about the accuracy of various 
estimates without altering the conclusion. The probability of producing the proteins 
necessary to build a minimally complex cell — or the genetic infonnation necessary to 
produce those proteins — by chance is unimaginably small. 



Conclusion 

Axe’s work confirmed the intuitions of an older generation origin-of-life researchers and 
other scientists — such as Harold Morowitz, Fred Hoyle, Chandra Wickramasinghe, 
Alexander Cairns-Smith, Ilya Prigogine, Hubert Yockey, Christian de Duve, Robert 
Shapiro, and Francis Crick (after 1980) — who were deeply skeptical about the chance 
hypothesis. Many of these scientists had performed their own calculations in which they 
assumed extremely favorable prebiotic conditions, more time than was actually available 
on the early earth, and maximally fast reaction rates between the chemical constituents of 
proteins, DNA, and RNA. Invariably, such calculations have fueled greater skepticism 
about the chance hypothesis, especially since origin-of-life researchers also recognize 
that DNA and proteins possess functionally specified, rather than just Shannon, 
information. 

But I knew from my discussions with Dembski and from my own research on the 
origin-of-life problem that there were other reasons to doubt the chance hypothesis. In the 
next chapter, I discuss why these other considerations, in conjunction with the factors 
discussed here, justify eliminating chance from consideration as the best explanation for 
the origin of the information necessary to produce the first life. 
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Beyond the Reach of Chance 



As I investigated the question of whether biological information might have arisen by 
chance, it became abundantly clear to me that the probability of the necessary events is 
exceedingly small. Nevertheless I realized, based on my previous conversations with Bill 
Dembski, that the probability of an event by itself does not alone determine whether the 
event could be reasonably explained by chance. The probabilities, as small as they were, 
were not by themselves conclusive. I remembered that I also had to consider the number 
of opportunities that the event in question might have had to occur. I had to take into 
account what Dembski called the probabilistic resources. 

But what were those resources — how many opportunities did the necessary proteins or 
genes have to arise by chance? The advocates of the chance hypothesis envisioned amino 
acids or nucleotide bases, phosphates, and sugars knocking into each other in an 
ocean-sized soup until the correct arrangements of these building blocks arose by chance 
somewhere. Surely, such an environment would have generated many opportunities for 
the assembly of functional proteins and DNA molecules. But how many? And were there 
enough such opportunities to render these otherwise exceedingly improbable events 
probable? 

Here again Bill Dembski’ s work gave me a way to answer this question. Dembski had 
calculated the maximum number of events that could actually have taken place during the 
history of the observable universe. 1 He did this to establish an upper boundary on the 
probabilistic resources that might be available to produce any event by chance. 

Dembski ’s calculation was elegantly simple and yet made a powerful point. He noted that 
there were about 10 elementary particles in the observable universe. (Because there is 
an upper limit on the speed of light, only those parts of the universe that are observable to 
us can affect events on earth. Thus, the observable universe is the only part of the 
universe with probabilistic resources relevant to explaining events on earth.) Dembski 
also noted that there had been roughly 10 16 seconds since the big bang. (A few more 
have transpired since he made the calculation, but not enough to make a difference!) 

He then introduced another parameter that enabled him to calculate the maximum number 
of opportunities that any particular event would have to take place since the origin of the 
universe. Due to the properties of gravity, matter, and electromagnetic radiation, 
physicists have determined that there is a limit to the number of physical transitions that 
can occur from one state to another within a given unit of time. According to physicists, a 
physical transition from one state to another cannot take place faster than light can 
traverse the smallest physically significant unit of distance (an indivisible “quantum” of 
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space). That unit of distance is the so-called Planck length of 10 centimeters. 
Therefore, the time it takes light to traverse this smallest distance determines the shortest 
time in which any physical effect can occur. This unit of time is the Planck time of 1 0 43 
seconds. 

Knowing this, Dembski was able to calculate the largest number of opportunities that any 
material event had to occur in the observable universe since the big bang. Physically 
speaking, an event occurs when an elementary particle does something or interacts with 
other elementary particles. But since elementary particles can interact with each other 
only so many times per second (at most 10 43 times), since there are a limited number 
(10 8 °) of elementary particles, and since there has been a limited amount of time since 
the big bang (10 16 seconds), there are a limited number of opportunities for any given 
event to occur in the entire history of the universe. 

Dembski was able to calculate this number by simply multiplying the three relevant 
factors together: the number of elementary particles (10 8 °) times the number of seconds 
since the big bang (10 16 ) times the number of possible interactions per second (10 43 ). His 
calculation fixed the total number of events that could have taken place in the observable 
universe since the origin of the universe at 10 139 . 4 This then provided a measure of the 
probabilistic resources of the entire observable universe. 

Other mathematicians and scientists have made similar calculations. 5 During the 1930s, 
the French mathematician Emile Borel made a much less conservative estimate of the 
probabilistic resources of the universe, which he set at 10 50 . 6 More recently, University 
of Pittsburgh physicist Bret Van de Sande has calculated the probabilistic resources of the 
universe at a more restrictive 2.6 x 10 . MIT computer scientist Seth Lloyd has 
calculated that the most bit operations the universe could have performed in its history 
(assuming the entire universe were given over to this single-minded task) is 10 13 , 
meaning that a specific bit operation with an improbability significantly greater than 1 
chance in 10 will likely never occur by chance. None of these probabilistic resources 
is sufficient to render the chance hypothesis plausible. Dembski’s calculation is the most 
conservative and gives chance its “best chance” to succeed. But even his calculation 
confirms the implausibility of the chance hypothesis, whether chance is invoked to 
explain the infonnation necessary to build a single protein or the infonnation necessary to 
build the suite of proteins needed to service a minimally complex cell. 

Recall that the probability of producing a single 150-amino-acid functional protein by 
chance stands at about 1 in 10 164 . Thus, for each functional sequence of 150 amino acids, 
there are at least 10 164 other possible nonfunctional sequences of the same length. 
Therefore, to have a good (i.e., better than 50-50) chance of producing a single functional 
protein of this length by chance, a random process would have to generate (or sample) 
more than one-half of the 10 164 nonfunctional sequences corresponding to each 
functional sequence of that length. Unfortunately, that number vastly exceeds the most 
optimistic estimate of the probabilistic resources of the entire universe — that is, the 
number of events that could have occurred since the beginning of its existence. 




To see this, notice again that to have a good (better than 50-50) chance of generating a 
functional protein by chance, more than half of the 10 164 sequences would have to be 
produced. Now compare that number (call it .5 x 10 164 ) to the maximum number of 
opportunities — 10 — for that event to occur in the history of the universe. Notice that 
the first number (.5 x 10 164 ) exceeds the second (10 139 ) by more than twenty-four orders 
of magnitude, by more than a trillion trillion times. 

What does this mean? It means that if every event in the universe over its entire history 
were devoted to producing combinations of amino acids of the correct length in a 
prebiotic soup (an extravagantly generous and even absurd assumption), the number of 
combinations thus produced would still represent a tiny fraction — less than 1 out of a 
trillion trillion — of the total number of events needed to have a 50 percent chance of 
generating a functional protein — any functional protein of modest length by chance 
alone. 
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In other words, even if the theoretically maximum number (10 ) of amino-acid 

sequences possible were generated, the number of candidate sequences would still 
represent a minuscule portion of the total possible number of sequences (of a given 
length). For this reason, it would be vastly more probable than not that a functional 
protein of modest length would not have arisen by chance — simply too few of the 
possible sequences would have been sampled to provide a realistic opportunity for this to 
occur. Even taking the probabilistic resources of the whole universe into account, it is 
extremely unlikely that even a single protein of that length would have arisen by chance 
on the early earth. (And, as explained in the accompanying endnote, proteins capable of 
performing many necessary features of a minimally complex cell often have to be at least 
150 amino acids in length. Moreover, there are good reasons to think that these large 
necessary proteins could not evolve from simpler proteins or peptide chains.) 9 

To see this probabilistic reasoning in everyday terms, imagine that Slick perfonns a blind 
search for a single red marble in a huge gunnysack filled with 10,000 marbles, the 
remainder of which are blue. To have a better than 50 percent chance of finding the one 
red marble, Slick must select more than 5,000 of the marbles. But Slick has only ten 
seconds in which to sample the marbles. Further, it takes one second to find and put each 
marble aside in another jar. Thus, he can hope to sample only 10 out of the 10,000 
marbles in the time available. Is it likely that Slick would find the red marble? Clearly 
not. Given his probabilistic resources, he has just 1 chance in 1,000 of finding the red 
marble, which is much less than 1 in 2, or 50 percent. 10 Thus, it is much more likely than 
not that he will not find the red marble by chance in the time available. 

In the same way, it is much more likely than not that a random process would not 
produce (or find) even one functional protein (of modest length) in the whole history of 
the universe. Given the number of possible sequences that need to be sampled and the 
number of opportunities available to do so, the odds of success are much smaller than 
1/2 — the point at which the chance hypothesis becomes reasonable (see below). Indeed, 
the odds of producing a single functional protein by chance in the whole history of the 




universe are actually much smaller than the odds of Slick finding the one red marble in 
my illustration. And beyond that, of course, the odds of producing the suite of proteins 
necessary to service a minimally complex cell by chance alone are almost unimaginably 
smaller. Indeed, the improbability of that event — calculated conservatively (see Chapter 
9) at 1 chance in 10 41 ’ 00 — completely dwarfs the probabilistic resources of the whole 
universe. Taking all those resources — 10 possible events — into account only increases 

the probability of producing a minimally complex cell by chance alone to, at best, 1 
chance in 10 4 ^’ 861 , again, an unimaginably small probability. 

And that is the third reason that the origin-of-life researchers have rejected the chance 
hypothesis. The complexity of the events that origin-of-life researchers need to explain 
exceeds the probabilistic resources of the entire universe. In other words, the universe 
itself does not possess the probabilistic resources necessary to render probable the origin 
of biological information by chance alone. 



The “Chance” of Chance 

I knew from my conversations with Bill Dembski and my study of statistical hypothesis 
testing that the occurrence of an improbable event alone does not justify rejecting a 
chance hypothesis. To justify eliminating chance, I knew that it was also necessary to 
recognize a pattern in an event and to consider the available probabilistic resources. The 
calculations presented in the previous section meet both of these conditions. DNA and 
proteins manifest functionally significant patterns. And, given all the probabilistic 
resources of the universe, the odds of producing a functional protein of modest length 
stands at less than one chance in trillion trillion. 

In making this calculation, I have computed what statisticians call a “conditional 
probability.” A conditional probability measures the probability of one thing being true 
on the assumption that another thing is true. In this case, I have calculated the probability 
of a functional protein occurring by chance given, or “conditioned on,” a best-case 
estimate of the relevant probabilistic resources. 

But this calculation has also been “conditioned” on something else. Recall that all along I 
had been attempting to determine the odds of a functional protein occurring by chance. 
That itself was another kind of conditional or “given.” I had not just been asking: What 
are the odds that a functional protein would arise? I had been asking: What are the odds 
that a functional protein or a minimally complex cell would arise by chance, given the 
available probabilistic resources? In other words, I had been asking: What are the odds 
that a functional protein or a cell would arise given the chance hypothesis (i.e., given the 
truth of the chance hypothesis)? Recall that the chance hypothesis in this case asserts that 
functional proteins or infonnation-rich DNA molecules arose from the random 
interactions of molecular building blocks in a prebiotic soup. Framing the question this 
way — as a question about the probability of the origin of proteins given the chance 
hypothesis — provided grounds for evaluating whether it was more reasonable or not to 
accept the chance hypothesis. 




This was important, because I often encounter people who think that it makes sense to 
cling to the chance hypothesis as long there was some chance — any chance, however 
small — that life might have arisen by some specified or even unspecified random process. 
They often say things like, “Sure, the origin of life is overwhelmingly improbable, but as 
long as there is at least some chance of life arising by chance, then we shouldn’t reject the 
possibility that it did.” 

This way of reasoning turns out to be fallacious, however, because it fails to recognize 
what probabilistic resources can tell us about whether it is more reasonable to accept or 
reject chance. Consider a case in which all probabilistic resources have been considered 
and the conditional probability of an event occurring given the chance hypothesis is 
greater than 1/2. That means that it is more likely than not that the event in question 
would have occurred by chance (if every opportunity for it to occur had been realized). If 
Slick is given not one minute, but, say, twelve hours (720 minutes) to sample the bag of 
marbles, it is more likely than not that he would find the red marble by sampling 
randomly. In twelve hours Slick can sample 7,200 marbles, which gives him a better than 
50 percent chance of finding the red marble. 

Conversely, if after all probabilistic resources have been considered and the conditional 
probability of an event occurring by chance is less than 1/2, then it is less likely than not 
that the event will occur by chance. In the case that such an event has already occurred 
and we have no direct knowledge of how it occurred, it makes more sense to reject the 
chance hypothesis than to accept it. 

My earlier illustrations made this point. When our hypothetical criminal claimed to have 
flipped 100 heads in a row on his second day of jail, suspicions were rightly raised. Given 
the improbability of the required outcome (producing 100 heads in a row by flipping a 
fair coin) and the limited probabilistic resources available to the prisoner, the court 
assumed — reasonably — that the prisoner had cheated, that something other than chance 
had been at work. 

Similarly, imagine that after starting my demonstration with Scrabble letters, I left the 
classroom for a few minutes and instructed my students to continue picking letters at 
random and writing the results on the board in my absence. Now imagine that upon my 
return they showed me a detailed message on the blackboard such as Einstein’s famous 
dictum: “God does not play dice with the universe.” Would it be more reasonable for me 
to suppose that they had cheated (perhaps, as a gag) or that they had gotten lucky? 
Clearly, I should suspect (strongly) that they had cheated. I should reject the chance 
hypothesis. Why? 

I should reject chance as the best explanation not only because of the improbability of the 
sequence my students had generated, but also because of what I knew about the 
probabilistic resources available to them. If I had made a prediction before I had left the 
room, I would have predicted that they could not have generated a sequence of that length 




by chance alone in the time available. But even after seeing the sequence on the board, I 
still should have rejected the chance hypothesis. 



Indeed, I should know that my students did not have anything like the probabilistic 
resources to have a realistic chance of generating a sequence of that improbability by 
chance alone. In one hour my students could not have generated anything but a minuscule 
fraction of the total possible number of 40-character sequences corresponding to the 
length of the message they had written on the board. The odds that they could have 
produced that sequence — or any meaningful sequence at all of that length — in the time 
available by choosing letters at random was exceedingly low — much less than 1/2. They 
simply did not have time to sample anything close to the number of 40-character 
sequences that they would have needed to have a 50 percent chance of generating a 
meaningful sequence of that length. Thus, it would be much more likely than not that 
they would not produce a meaningful sequence of that length by chance in the time 
available and, therefore, it was also vastly more likely than not that something other than 
chance had been in play. 



Decision Time: Assessing the Chance Hypothesis 

Following many leading origin-of-life researchers, I came to the same conclusion about 
the first life and even the first genes and proteins: it is much more likely than not that 
chance alone did not produce these phenomena. Life, of course, does exist. So do the 
information-rich biological macromolecules upon which living cells depend. But the 
probability that even one of these information-rich molecules arose by chance, let alone 
the suite of such molecules necessary to maintain or build a minimally complex cell, is so 
small as to dwarf the probabilistic resources of the entire universe. The conditional 
probability that just one of these information-rich molecules arose by chance — in effect, 
the chance that chance is true — is much less than one-half. It is less than one in a trillion 
trillion. Thus, I concluded that it is more reasonable to reject the chance hypothesis than 
to accept it. 

This was an intellectually liberating conclusion. Anyone can claim that a fantastically 
improbable event might have occurred by chance. Chance, in that sense, is always a 
possible explanation. But it doesn’t follow that chance necessarily constitutes the best 
explanation. And following what I knew about the historical scientific method, I wanted 
to find the best explanation for the origin of biological information. When I realized that I 
did not need to absolutely disprove the chance hypothesis in order to make an objective 
detennination about its merits, clarity came. By assessing the probability of an event in 
light of the available probabilistic resources, I could determine whether it was more 
reasonable to affirm or to reject the chance hypothesis for that event. When I realized that 
it was far more reasonable to reject the chance hypothesis for the origin of functional 
genes and proteins, I concluded that chance was not a terribly promising candidate for 
“best explanation” of the DNA enigma. Chance was certainly not a more reasonable 
explanation than its negation, namely, that something other than chance had been at work 
in the origin of biological infonnation. Indeed, when I remembered that the chance 




hypothesis implicitly negated both design and lawlike necessity, and that rejecting 
chance, therefore, meant affirming “something other than chance” at work, I began to 
evaluate alternative explanations. 

My own reasoning about the chance hypothesis mirrored that of many origin-of-life 
researchers, many of whom expressed exasperation at the way some scientists used 
“chance” as a catchall explanation or a cover for ignorance. 11 For example, after I first 
began reading the scientific articles about the origin of life, I found a spirited critique of 
chance published in Nature in 1963, just about the time molecular biologists were first 
coming to grips with the complexity of DNA and proteins. The paper was written by P. 
T. Mora, a senior research biologist at the National Institutes of Health. Here’s what he 
wrote: 



To invoke statistical concepts, probability and complexity to account for the 

origin and the continuance of life is not felicitous or sufficient. As the complexity 

of a molecular aggregate increases, and indeed very complex arrangements and 

interrelationships of molecules are necessary for the simplest living unit, the 

probability of its existence under the disruptive and random influence of 

physico-chemical forces decreases; the probability that it will continue to 

function in a certain way, for example, to absorb and to repair, will be even 

lower; and the probability that it will reproduce, still lower. Statistical 

considerations, probability, complexity, etc., followed to their logical 

implications suggest that the origin and continuance of life is not controlled by 

such principles. An admission of this is the use of a period of practically infinite 

time to obtain the derived result. Using such logic, however, we can prove 

anything.... When in statistical processes, the probability is so low that for 

practical purposes infinite time must elapse for the occurrence of an event, 
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statistical explanation is not helpful. 



I had come to much the same conclusion. Not only were the odds overwhelmingly 
against life arising by chance even considering all available probabilistic resources, but 
the chance hypothesis was usually invoked in a way that didn’t explain anything. To say 
that “given infinite time, life might have arisen by chance” was, in essence, a tautology. 
Given infinite time, anything might happen. But that doesn’t explain why life originated 
here or what actually caused it to do so. 



Environmental Factors 

There were some additional reasons to reject the chance hypothesis. The chance 
hypothesis for the origin of infonnation-rich biological molecules assumes the existence 
of a favorable prebiotic soup in which an abundant supply of the chemical building 
blocks of proteins and nucleic acids could interact randomly over vast expanses of time. 
These chemical building blocks were thought to have been produced by the kinds of 




chemical reactions that Stanley Miller simulated in his famous 1953 experiment. Yet 
when Stanley Miller conducted his experiment simulating the production of amino acids 
on the early earth, he had presupposed that the earth’s atmosphere was composed of a 
mixture of what chemists call reducing gases, such as methane (CH 4 ), ammonia (NH 3 ), 
and hydrogen (H 2 ). He also assumed that the earth’s atmosphere contained virtually no 

1 'J 

free oxygen. In the years following Miller’s experiment, however, new geochemical 
evidence showed that the assumptions Miller had made about the early atmosphere were 
incorrect. Instead, evidence strongly suggested that neutral gases such as carbon dioxide, 
nitrogen, and water vapor 14 — not methane, ammonia, and hydrogen — predominated in 
the early atmosphere. Moreover, a number of geochemical studies showed that significant 
amounts of free oxygen were also present even before the advent of plant life, probably 
as the result of the photo-dissociation of water vapor . 15 

This new geological and geochemical evidence implied that prebiotic atmospheric 
conditions were hostile, not friendly, to the production of amino acids and other essential 
building blocks of life. As had been well known even before Miller’s experiment, amino 
acids will form readily in a mixture of reducing gases. In a chemically neutral 
atmosphere, however, reactions among atmospheric gases will not take place readily, and 
those reactions that do take place will produce extremely low yields of biological 
building blocks . 16 Further, even a small amount of atmospheric oxygen will quench the 
production of biologically significant building blocks and cause biomolecules otherwise 
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present to degrade rapidly. 

An analogy may help to illustrate. Making amino acids in a reducing atmosphere is like 
getting vinegar and baking soda to react. Because the reaction releases stored chemical 
energy as heat, it occurs easily. (It is an example of what chemists call an “exothermic” 
reaction.) Trying to make biological building blocks in a neutral atmosphere, however, is 
more like trying to get oil and water to mix . 18 

Scientists investigating the origin of life haven’t tried to adjust their probability 
calculations in light of these developments. But they have recognized that these 
developments do complicate matters further for the chance hypothesis. To make matters 
worse, an accumulating body of geochemical evidence has shown — perhaps, not 
surprisingly, in light of the previous discussion — that there likely never was a prebiotic 
soup. Two leading geochemists, James Brooks and Gordon Shaw, argued that if an ocean 
rich in amino and nucleic acids had existed, it would have left large deposits of 
nitrogen-rich minerals (nitrogenous cokes) in metamorphosed Precambrian sedimentary 
rocks. No evidence of such deposits exists, however. In the words of Brooks: “The 
nitrogen content of early Pre-Cambrian organic matter is relatively low (less than .015%). 
From this we can be reasonably certain that: there never was any substantial amount of 
‘primitive soup’ on earth when Pre-Cambrian sediments were fonned; if such a soup ever 
existed it was only for a brief period of time .” 19 



Given my own deliberations about what constituted a substantive rather than a vacuous 
chance hypothesis, this seemed significant. A substantive chance hypothesis must invoke 




a definite process that produces the outcome in question with some regular or statistically 
predictable frequency — just as a roulette wheel produces various outcomes with a 
predictable frequency. The chance hypothesis envisioned DNA and proteins arising from 
a random process of chemical “roulette” in a favorable nitrogen-rich prebiotic ocean. If 
no such environment had ever existed, then whatever specificity the chance hypothesis 
might have once had was now lost. If there was no “chemical roulette” in which life 
would emerge as an inevitable if improbable outcome, chance could no longer be 
considered a substantive hypothesis; it would instead be just a vacuous notion that at best 
concealed ignorance of the true cause of life’s origin. 

Additionally, I knew from my Ph.D. work that there were other significant, if less 
quantifiable, problems with the idea that information-rich biomolecules had arisen by 
chance from a prebiotic soup. Most origin-of-life researchers recognized that, even if 
there had been a favorable prebiotic soup, many destructive chemical processes would 
have necessarily been at work at the same time. Simulation experiments of the type 
performed by Stanley Miller had repeatedly demonstrated this. They have invariably 
produced nonbiological substances in addition to biological building blocks such as 
amino acids. Without intelligent intervention, these other substances will react readily 
with biologically relevant building blocks to form biologically irrelevant 
compounds — chemically insoluble sludge. To prevent this from happening and to move 
the simulation of chemical evolution along a biologically promising trajectory, 
experimenters often remove those chemicals that degrade or transform amino acids into 
nonbiologically relevant compounds. They also must artificially manipulate the initial 
conditions in their experiments. For example, rather than using both short-and 
long-wavelength ultraviolet light, which would have been present in any realistic early 
atmosphere, they use only short-wavelength UV. Why? The presence of the 
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long-wavelength UV light quickly degrades amino acids. 



Zero Hour 

I began to wonder if the odds of life arising by chance alone, at least under the 
circumstances envisioned by advocates of the chance hypothesis, weren’t actually zero. 
Imagine that a casino owner invents a game in which the object is to roll 777 consecutive 
“sevens” with a set of dice. He asks the odds makers to calculate the chances of any one 
contestant winning. They are, of course, infinitesimally small. But now he gives the odds 
makers some additional information. The dice are made of white chocolate with dark 
chocolate spots on the faces, both of which will melt as the result of the glare of the lights 
over the game table and repeated handling by the game players. Now what are the odds 
of turning up 777 sevens in a row? Clearly, they have diminished further. In fact, they are 
not just effectively zero, but under these circumstances with destructive processes 
inevitably predominating they are actually zero. Seven hundred seventy-seven “sevens” 
will never appear, because the faces of the dice will be destroyed in the attempt to 
generate them. The destructive processes will ensure that the desired outcome will never 




occur. I wondered if the same problem didn’t afflict the chance hypothesis for the origin 
of life. 



In the face of these and other difficulties, most origin-of-life researchers have decided to 
consider other theories that do not rely heavily on chance. In the next chapter, I examine 
one of the other main contending approaches: self-organization. Since the odds of a 
purely random process producing life are “vanishingly small,” many scientists have 
concluded that some nonrandom, lawlike process must have been at work to help 
overcome these odds. If chance is insufficient, then perhaps “necessity” will do the job. 
Christian de Duve expresses the reasoning of researchers who favor this approach: “A 
string of improbable events — drawing the same lottery number twice, or the same bridge 
hand twice in a row — does not happen naturally. All of which lead me to conclude that 
life is an obligatory manifestation of matter, bound to arise where conditions are 
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appropriate. 

Nevertheless, I should note that a few theorists have attempted to either retain or 
resuscitate a role for chance. They have done so in one of two ways. Either they have 
attempted to lower the complexity threshold that random processes must first produce or 
postulated an increase in the probabilistic resources available to such processes. 

Some theorists, notably those proposing an initial “RNA world,” have sought to retain a 
role for chance by suggesting that natural selection might have played a key role in the 
origin of life, even before the origin of a fully functioning cell. They propose combining 
chance with natural selection (or other lawlike processes) as a way of explaining how the 
first cell arose. In doing so, they suggest that random processes would have had to 
produce much less biological information by chance alone. Once a self-replicating 
molecule or small system of molecules had arisen, natural selection would “kick in” to 
help produce the additional necessary information. In Chapter 14, 1 evaluate theories that 
have adopted this strategy. 

Others theorists have attempted to resuscitate chance theories altogether by postulating 
the existence of other possible universes beyond our own. In doing so, they have 
attempted to increase the probabilistic resources available for producing biological 
information and life itself. These other universes would presumably provide more 
opportunities to generate favorable environments in which random processes could 
generate the vast number of combinatorial possibilities necessary to give life a realistic 
chance of arising somewhere. Since this idea depends upon highly speculative and 
technical cosmological models, I have decided to evaluate it separately, in Appendix B. 
There I argue that although it is impossible to disprove the existence of other universes, 
postulating such universes does not actually provide either a satisfying explanation or, 
still less, the best explanation of the origin of biological information. As I explain, there 
are several reasons that such models fail to do so. For now, however, it is fair to say that 
most serious origin-of-life researchers have not found speculations about other possible 
universes terribly relevant to understanding what actually happened on the early earth to 
produce the first life on our planet. So in the next chapters, I turn my attention to 




evaluating the other leading contenders I have encountered in my investigation of the 
DNA enigma. 
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Self-Organization and Biochemical Predestination 



By 1968, the chance hypothesis was already suspect. The preceding fifteen years had 
revealed a cellular realm of stunning complexity, and though it would take several more 
decades to pinpoint the exact odds of a functioning protein or DNA molecule arising by 
chance alone, many scientists already had grown skeptical of chance as an explanation 
for the origin of life. Among them was a young biophysicist at San Francisco State 
University named Dean Kenyon. As he wrote a year later: “It is sometimes argued in 
speculative papers on the origin of life that highly improbable events (such as the 
spontaneous formation of a molecule of DNA and a molecule of DNA polymerase in the 
same region of space at the same time) become virtually inevitable over the vast stretches 
of geological time. No serious quantitative arguments, however, are given in support of 
such conclusions.” Instead, he argued, “such hypotheses are contrary to most of the 
available evidence.” 1 To emphasize the point Kenyon noted: “If the association of amino 
acids were a completely random event... there would not be enough mass in the entire 
earth, assuming it was composed exclusively of amino acids, to make even one molecule 
of every possible sequence of. . .a low-molecular- weight protein.” 

Kenyon began exploring a different approach. With a Ph.D. from Stanford and having 
worked under Melvin Calvin, a leading biochemist and origin-of-life researcher at the 
University of California, Berkeley, Kenyon was steeped in the scientific culture of the 
late 1960s. He was well aware of Jacques Monod’s conceptual dichotomy between 
chance and necessity. He understood that the logical alternative to chance was what 
Monod called necessity, the lawlike forces of physics and chemistry. If chance events 
couldn’t explain the origin of biological information, then, Kenyon thought, perhaps 
necessity could. Eventually he and a colleague, Gary Steinman, proposed just such a 
theory; except that they didn’t call it “necessity.” They called it 
“predestination” — biochemical predestination. In 1969 Kenyon and Steinman wrote a 
book by this title. Through the 1970s and early 1980s it became the bestselling 
graduate-level text on the origin of life and established Kenyon as a leading researcher in 
the field. 

In Biochemical Predestination, Kenyon and Steinman not only presented a new theory 
about the origin of biological information; they also inaugurated a fundamentally new 
approach to the origin-of-life problem, one that came to be called “self-organization.” In 
physics, the term “self-organization” refers to a spontaneous increase in the order of a 
system due to some natural process, force, or law. For example, as a bathtub drains, a 
highly ordered structure, a vortex, forms as the water swirls down the drain under the 
influence of gravity and other physical forces such as the Coriolis force. In this system, a 



force (gravity) acts on a system (the water in the draining tub) to generate a structure (the 
vortex), thus increasing the order of the system. Self-organizational theories of the origin 
of life try to attribute the organization in living things to physical or chemical forces or 
processes — ones that can be described mathematically as laws of nature. 

Of course, the self-organizational theorists recognized that they needed to do more than 
simply assert that life arose by physical or chemical necessity. They needed to identify a 
specific lawlike process (or series of processes) that could generate life or critical 
components of living cells starting from some specific set of conditions. In other words, 
self-organizational theorists needed to identify deterministic processes that could help 
overcome the otherwise long odds against the origin of life occurring by chance alone. 
And that is what Kenyon and those who followed in his wake set out to do. In the process 
Kenyon and other self-organizational theorists formulated theories that either tried to 
explain, or circumvented the need to explain, the DNA enigma. 



More Than the Sum 

What accounts for the difference between a chu nk of marble and a great sculpture? What 
explains the difference between a cloud and a message created from steam by a 
skywriter? What accounts for the difference between a functioning cell and the various 
molecules that jointly constitute it? These questions raise the classical philosophical issue 
of reductionism. Does the whole reduce to the sum of its parts? Or conversely, do the 
properties of the parts explain the structure and organization of the whole? As with many 
such questions, the best answer is, “It depends.” 

Consider a silver knife, fork, and spoon. The metal in all three has a definite set of 
chemical properties describable by physical and chemical laws. Silver atoms will react 
with certain atoms, under certain conditions, but not with others. Silver will melt at 
certain temperatures and pressures, but not at others. Silver has a certain strength and 
resistance to shearing. Insofar as these properties reliably manifest themselves under 
certain specified conditions, they can be described with general laws. Nevertheless, none 
of these properties of silver accounts for the shape of the various utensils and, therefore, 
the functions they perform. 

From a purely chemical point of view nothing discriminates the silver in a spoon from 
that in a knife or fork. Nor does the chemistry of these items explain their arrangement in 
a standard place setting with the fork on the left and the knife and spoon on the right. The 
chemistry of the silverware is indifferent to how the silver is arranged on the table. The 
arrangement is determined not by the properties of the silver or by chemical laws 
describing them, but by the choice of a rational agent to follow a human convention. 





Figure 11.1. Dean Kenyon. Prin ted by permission from Dean Kenyon. 



Yet one cannot say the same of every structure. Many objects display a structure and 
order that results from the chemical properties of their ingredients. For example, a crystal 
of salt has a lattice structure that exhibits a striking, highly repetitive pattern. This 
structure results from forces of mutual electrostatic attraction (describable by natural law) 
among the atoms in the lattice. Similarly, a spiral galaxy has a definite structure that 
results largely from lawlike forces of gravitational attraction among the stars in the 
galaxies. Structure does not always defy reduction to chemistry and physics. 

What about living cells and the complex molecules they contain? Does their organization 
derive from the physical and chemical properties of their parts? Do the chemical 
constituents of proteins or DNA molecules have properties that could cause them to 
self-organize? Are there physical or chemical forces that make the production of 
information-rich molecules inevitable under plausible prebiotic conditions? Dean Kenyon 
thought the answer to these questions might well be yes. 



Biochemical Predestination 

Like Aleksandr Oparin, whom he credited with inspiration, Kenyon sought to account for 
the origin of the first life in a series of gradual steps. Like Oparin, Kenyon constructed a 
scenario describing how life might have arisen through a series of chemical 
transformations in which more complex chemical structures arose from simpler ones. In 
Kenyon’s model, simple monomers (e.g., amino acids, bases, and sugars) arose from 
simpler atmospheric gases and energy; polymers (proteins and DNA) arose from 
monomers; primitive membranes formed around these polymers; and a primitive 
metabolism emerged inside these membranes as various polymers interacted chemically 
with one another. Unlike Oparin, however, who relied on chance variations to achieve 



some of the chemical transformations, Kenyon relied more exclusively on detenninistic 
chemical reactions. In his scenario, each stage along the way to the origin of life was 
driven by detenninistic chemical processes, including the most important stage, the origin 
of biological information. 

Whereas Oparin had suggested that the process by which monomers arranged themselves 
into polymers in the prebiotic soup was essentially random, Kenyon and Steinman 
suggested that forces of chemical necessity had performed the work. Specifically, they 
suggested that just as electrostatic forces draw sodium (Na+) and chloride (C1-) ions 
together into highly ordered patterns within a crystal of salt, amino acids with special 
affinities for each other might have arranged themselves to fonn proteins. As the two 
scientists explained, “In the same fashion that the difference in the nature of reactivity of 
the units of a growing inorganic crystal determines the final constitution of the 
three-dimensional crystal array, so differences in reactivities of the various amino acids 
with one another could possibly serve to promote a defined ordering of sequence in a 
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growing peptide chain.” 

Kenyon and Steinman first came to this idea while perfonning “di-mer” bonding 
experiments in the laboratory of Melvin Calvin at the University of California, Berkeley. 
They wanted to see whether specific amino acids bond more readily with some amino 
acids than others. Their experimental results suggested that they do. For example, they 
discovered that glycine forms linkages with alanine twice as frequently as glycine with 
valine. Moreover, they discovered that these differences in chemical affinity seemed 
related to differences in chemical structure. Amino acids with longer side chains bond 
less frequently to a given amino acid than do amino acids with shorter side chains. 4 
Kenyon and Steinman summarized their findings in a table showing the various 
differential bonding affinities they discovered. 

In the wake of these findings, Kenyon and Steinman proposed that these differences in 
affinity imposed constraints on the sequencing of amino acids, rendering certain 
sequences more likely than others. As they put it, “It would appear that the unique nature 
of each type of amino acid as detennined by its side chain could introduce nonrandom 
constraints into the sequencing process.” 5 They further suggested that these differences in 
affinity might correlate with the specific sequencing motifs typical in functional proteins. 
If so, the properties of individual amino acids could themselves have helped to sequence 
the amino-acid chains that gave rise to functional proteins, thereby producing the 
information the chains contain. Biochemical necessity could thus explain the seeming 
improbability of the functional sequences that exist in living organisms today. Sequences 
that would be vastly improbable in the absence of differing bonding affinities might have 
been very probable when such affinities are taken into account. 

Kenyon and Steinman did not attempt to extend this approach to explain the information 
in DNA, since they favored a protein- first model. 6 They knew that proteins perform most 
of the important enzymatic and structural functions in the cell. They thought that if 
functional proteins could arise without the help of nucleic acids, then initially they need 




not explain the origin of DNA and RNA and the infonnation they contained. Instead, they 
envisioned proteins arising directly from amino acids in a prebiotic soup. They then 
envisioned some of these proteins (or “proteinoids”) fonning membranes surrounding 
other proteins. Thus, the two were convinced that only later, once primitive metabolic 
function had arisen, did DNA and RNA need to come on the scene. Kenyon and 
Steinman did not speculate as to how this had happened, content as they were to think 
they may have solved the more fundamental question of the origin of biological 
information by showing that it had arisen first in proteins. 

Doubts About Self-Organization 

Did Kenyon and Steinman solve the DNA enigma? Was the information necessary to 
produce the first life “biochemically predestined” to arise on the early earth? 
Surprisingly, even as their bold self-organizational model grew in popularity among 
origin-of-life researchers, Kenyon himself began to doubt his own theory. 

Kenyon’s doubts first surfaced in discussions with one of his students at San Francisco 
State University. In the spring of 1975 near the end of a semester-long upper-division 
course on evolution, a student began to raise questions about the plausibility of chemical 
evolution. 8 The student — ironically named Solomon Darwin — pressed Kenyon to 
examine whether his self-organizational model could explain the origin of the 
information in DNA. Kenyon might have deflected this criticism by asserting that his 
protein-first model of self-organization had circumvented the need to explain the 
information in DNA. But by this point he found himself disinclined to make that defense. 

For some time Kenyon himself had suspected that DNA needed to play a more central 
role in his account of the origin of life. He realized that whether functional proteins had 
arisen before DNA or not, the origin of infonnation-rich DNA molecules still needed 
explanation, if for no other reason than because information-rich DNA molecules exist in 
all extant cells. At some point, DNA must have arisen as a carrier of the information for 
building proteins and then come into association with functional proteins. One way or 
another, the origin of genetic information still needed to be explained. 

Now he faced a dilemma. Having opted for a self-organizational approach, he had only 
two options for explaining the infonnation in DNA. Either (a) the specific sequences of 
amino acids in proteins had somehow provided a template for sequencing the bases in 
newly fonning DNA molecules or (b) DNA itself had self-organized in much the same 
way he and Steinman supposed proteins had. As he reflected more on Solomon Darwin’s 
challenge, Kenyon realized that neither option was very promising. First, Kenyon knew 
that to propose that the infonnation in proteins had somehow directed the construction of 
DNA would be to contradict everything then kn own about molecular biology. In extant 
cells, DNA provides the template of information for building proteins and not the reverse. 
Infonnation flows from DNA to proteins. Moreover, there are several good reasons for 
this asymmetry. Each triplet of DNA bases (and conesponding RNA codons) specifies 
exactly one amino acid during transcription and translation. Yet most amino acids 




correspond to more than one nucleotide triplet or RNA codon. This feature of the genetic 
code ensures that infonnation can flow without “degeneracy,” or loss of specificity, in 
only one direction, from DNA to proteins and not the reverse. 

Additionally, Kenyon realized that for structural and chemical reasons, proteins made 
poor candidates for replicators — molecules that can function as easily copied 
informational templates. Unlike DNA, proteins do not possess two antiparallel strands of 
identical infonnation and thus cannot be unwound and copied in the way DNA can. 
Further, proteins are highly reactive once they are unwound (due to exposed amino and 
carboxl groups and exposed side chains). For this reasons, most “denatured” (unwound) 
proteins tend to cross-link and aggregate. Others are quickly destroyed in the cell. Either 
way, denatured proteins tend to lose their structural stability and function. Moreover, they 
do not regain their original three-dimensional shape or activity once they lose it. By 
contrast, DNA is a stable, chemically inert molecule that easily maintains its chemical 
structure and composition while other molecules copy its infonnation. For all these 
reasons, it seemed difficult to envision proteins serving as replicators of their own stored 
information. Indeed, as Kenyon later told me, “getting the information out of proteins and 
into DNA” would pose an insuperable conceptual hurdle. 

And there was another difficulty. By the late 1980s new empirical findings challenged the 
idea that amino-acid bonding affinities had produced the biological information in 
proteins. Although Kenyon and Steinman had shown that certain amino acids form 
linkages more readily with some amino acids than with others, 9 new studies showed that 
these differential affinities do not correlate with actual sequencing patterns in large 
classes of known proteins. 10 In other words, differential bonding affinities exist, but they 
don’t seem to explain (or to have determined) the specific sequences of amino acids that 
actual proteins now possess. Instead, there was a much more plausible explanation of 
these new findings, one consistent with the Crick’s central dogma. The amino-acid 
sequences of known proteins had been generated by the infonnation encoded in 
DNA — specifically, by the genetic infonnation carried in the DNA of the organisms in 
which these proteins reside. After all, proteins in extant cells are produced by the 
gene-expression system. Of course, someone could argue that the first proteins arose 
directly from amino acids, but now it was clear there was, at the very least, no evidence 
of that in the sequences of amino acids in known proteins. 

All this would later reinforce Kenyon’s conviction that he could not circumvent the need 
to explain the origin of information in DNA by positing a protein-first model. He would 
have to confront the DNA enigma. 

Kenyon realized that if the information in DNA did not arise first in proteins, it must 
have arisen independently of them. This meant that the base sequences of DNA (and the 
molecule itself) must have self-organized under the influence of chemical laws or forces 
of attraction between the constituent monomers. Yet based upon his knowledge of the 
chemical structure of the DNA molecule, Kenyon doubted that DNA possessed any 
self-organizational properties analogous to those he had identified in amino acids and 




proteins. He was strengthened in this conclusion by reading an essay about DNA by a 
distinguished Hungarian-born physical chemist and philosopher of science named 
Michael Polanyi. As it happened, this same essay would shape my own thinking about 
the viability of self-organizational theories as well. 



Life Transcending Physics and Chemistry 

I first encountered Dean Kenyon in 1985, sixteen years after the publication of 
Biochemical Predestination. I was in the audience in Dallas during the presentation in 
which Kenyon announced that he had come to doubt all current chemical evolutionary 
theories of the origin of life — including his own. Thus, when I arrived in Cambridge, I 
already knew that one of the leading proponents of “self-organization” had repudiated his 
own work. Nevertheless, I did not yet fully appreciate why he had done so. I learned from 
Charles Thaxton that Kenyon had begun to doubt that his protein-first models eliminated 
the need to explain the origin of infonnation in DNA. But I didn’t yet see why explaining 
the DNA enigma by reference to self-organizing chemical laws or forces would prove so 
difficult. 

That began to change as I encountered the work of Michael Polanyi. In 1968, Polanyi 
published a seminal essay about DNA in the journal Science, “Life’s Irreducible 
Structure,” and another essay in the Chemical and Engineering News the year before, 
“Life Transcending Physics and Chemistry.” During my first year of graduate study, I 
wrote a paper about the two essays and discussed them at length with my supervisor, 
Harmke Kamminga, since it seemed to me that Polanyi ’s insights had profound 
implications for self-organizational models of the origin of life. 

Polanyi ’s essays did not address the origin of life question directly, but rather a classical 
philosophical argument between the “vitalists” and “reductionists” (discussed in Chapter 
2). Recall, that for centuries, vitalists and reductionists had argued about whether a 
qualitative distinction existed between living and nonliving matter. Vitalists maintained 
that living organisms contained some kind of immaterial “vital force” or spirit, an elan 
vital that distinguished them qualitatively from nonliving chemicals. Reductionists, for 
their part, held that life represented merely a quantitatively more complex form of 
chemistry. Thus, in their view, living organisms, like complex machines, functioned as 
the result of processes that could be “reduced” or explained solely by reference to the 
laws of physics and chemistry. 

The molecular biological revolution of the 1950s and 1960s seemed to confirm the 
reductionist perspective. 11 The newly discovered molecular mechanisms for storing and 
transmitting information in the cells confirmed for many biologists that the distinctive 
properties of life could, as Francis Crick put it in 1966, “be explained in terms of the 
ordinary concepts of physics and chemistry or rather simple extensions of them.” As 
Richard Dawkins later wrote, the discovery of DNA’s role in heredity “dealt the final, 
killing blow to the belief that living material is deeply distinct from nonliving 
material.” 13 




But had it really? Even if biochemists were no longer looking for some mysterious life 
force, was it really clear that living things could be explained solely by reference to the 
laws of physics and chemistry? 

Polanyi’s answer turned the classical reductionism-vitalism debate on its head. He did 
this by challenging an assumption held by reductionists and vitalists alike, namely, that 
“so far as life can be represented as a mechanism, it [can be] explained by the laws of 
inanimate nature.” 14 Whereas vitalists had argued against reductionism by contesting that 
life can be understood mechanistically, Polanyi showed that reductionism fails even if 
one grants that living organisms depend upon many mechanisms and machines. To show 
this, Polanyi argued that even if living organisms function like machines, they cannot be 
fully explained by reference to the laws of physics and chemistry. 15 

Consider an illustration. A 1960s vintage computer has many parts, including transistors, 
resistors, and capacitors. The electricity flowing through these various parts conforms to 
the laws of electromagnetism, for example, Ohm’s law (E = IR, or voltage equals current 
times resistance). Nevertheless, the specific structure of the computer, the configuration 
of its parts, does not result from Ohm’s or any other law. Ohm’s law (and, indeed, the 
laws of physics generally) allows a vast ensemble of possible configurations of the same 
parts. Given the fundamental physical laws and the same parts, an engineer could build 
many other machines and structures: different model computers, radios, or quirky pieces 
of experimental art made from electrical components. The physical and chemical laws 
that govern the flow of current in electrical machines do not detennine how the parts of 
the machine are arranged and assembled. The flow of electricity obeys the laws of 
physics, but where the electricity flows in any particular machine depends upon the 
arrangement of its parts — which, in turn, depends on the design of an electrical engineer 
working according to engineering principles. And these engineering principles, Polanyi 
insisted, are distinct from the laws of physics and chemistry that they harness. 

Polanyi demonstrated that the same thing was true of living things. He did this by 
showing that communications systems, like machines, defy reduction to physical and 
chemical law and by showing further that living organisms contain a communications 
system, namely, the gene-expression system in which DNA stores information for 
building proteins. 

Polanyi argued that, in the case of communications systems, the laws of physics and 
chemistry do not determine the arrangements of the characters that convey information. 
The laws of acoustics and the properties of air do not determine which sounds are 
conveyed by speakers of natural languages. Neither do the chemical properties of ink 
detennine the arrangements of letters on a printed page. Instead, the laws of physics and 
chemistry allow a vast array of possible sequences of sounds, characters, or symbols in 
any code or language. Which sequence of characters is used to convey a message is not 
detennined by physical law, but by the choice of the users of the communications system 
in accord with the established conventions of vocabulary and grammar — just as engineers 




determine the arrangement of the parts of machines in accord with the principles of 
engineering. 

Thus, Polanyi concluded, communications systems defy reduction to physics and 
chemistry for much the same reasons that machines do. Then he took a step that made his 
work directly relevant to the DNA enigma: he insisted that living things defy reduction to 
the laws of physics and chemistry because they also contain a system of 
communications — in particular, the DNA molecule and the whole gene-expression 
system. Polanyi argued that, as with other systems of communication, the lower-level 
laws of physics and chemistry cannot explain the higher-level properties of DNA. DNA 
base sequencing cannot be explained by lower-level chemical laws or properties any 
more than the infonnation in a newspaper headline can be explained by reference to the 
chemical properties of ink . 16 Nor can the conventions of the genetic code that detennine 
the assignments between nucleotide triplets and amino acids during translation be 
explained in this manner. Instead, the genetic code functions as a higher-level constraint 
distinct from the laws of physics and chemistry, much like a grammatical convention in a 
human language. 

Polanyi went further, arguing that DNA’s capacity to convey information actually 
requires a freedom from chemical determinism or constraint, in particular, in the 
arrangement of the nucleotide bases. He argued that if the bonding properties of 
nucleotides determined their arrangement, the capacity of DNA to convey information 
would be destroyed. In that case, the bonding properties of each nucleotide would 
detennine each subsequent nucleotide and thus, in turn, the sequence of the molecular 
chain. Under these conditions, a rigidly ordered pattern would emerge as required by 
their bonding properties and then repeat endlessly, fanning something like a crystal. If 
DNA manifested such redundancy, it would be impossible for it to store or convey much 
information. As Polanyi concluded, “Whatever may be the origin of a DNA 
configuration, it can function as a code only if its order is not due to the forces of 
potential energy. It must be as physically indetenninate as the sequence of words is on a 
printed page.” 8 

DNA and Self-Organization 

Polanyi’s argument made sense to me. DNA, like other communication systems, conveys 
information because of very precise configurations of matter. Was there a law of physics 
or chemistry that detennined these exact arrangements? Were there chemical forces 
dictating that only biologically functional base sequences and no others could exist 
between the strands of the double helix? After reading Polanyi’s essays, I doubted this. 

I realized that his argument also had profound implications for self-organizational 
theories of the origin of life. To say that the infonnation in DNA does not reduce to or 
derive from physical and chemical forces implied that the information in DNA did not 
originate from such forces. If so, then there was nothing Kenyon could do to salvage his 
self-organizational model. 




But was Polanyi correct? How did we know that the constituent parts of DNA did not 
possess some specific bonding affinities of the kind that Kenyon and Steinman had 
discovered in amino acids? Polanyi argued largely on theoretical grounds that the 
nucleotide bases could not possess such deterministic affinities and still allow DNA to 
store and transmit information. How do we know that such affinities do not exist? 

In 1987 when I first encountered Polanyi’s argument, I did not yet understand the 
chemical structure of DNA well enough to answer this question. And Polanyi, who no 
doubt did, did not bother to explain it in his articles. So the question lingered in the back 
of my mind for several years after completing my Ph.D. Not until the mid-1990s, when I 
began to make a more systematic evaluation of self-organizational theories, did I find an 
answer to it. 

I remember vividly the day the breakthrough came. I was listening to a colleague, a 
biologist at Whitworth College, teach a college class about the discovery of the double 
helix when I noticed something about the chemical structure of DNA on the slide that she 
had projected on the screen. What I noticed wasn’t anything I hadn’t seen before, but 
somehow its significance had previously escaped me. It not only confirmed for me 
Polanyi’s conclusion about the information in DNA transcending physics and chemistry, 
but it also convinced me that self-organizational theories invoking bonding affinities or 
forces of attraction would never explain the origin of the information that DNA contains. 

Figure 1 1 .2 shows what I saw on the slide that suddenly seized my attention. It portrays 
the chemical structure of DNA, including the chemical bonds that hold the molecule 
together. There are bonds, for example, between the sugar and the phosphate molecule 
forming the two twisting backbones of the DNA molecule. There are bonds fixing 
individual nucleotide bases to the sugar-phosphate backbones on each side of the 
molecule. There are also hydrogen bonds stretching horizontally across the molecule 
between nucleotide bases, forming complementary pairs. The individually weak 
hydrogen bonds, which in concert hold two complementary copies of the DNA message 
text together, make replication of the genetic instructions possible. But notice too that 
there are no chemical bonds between the bases along the longitudinal axis in the center of 
the helix. Yet it is precisely along this axis of the DNA molecule that the genetic 
information is stored. 





Figure 11.2. Model of the chemical structure of the DNA molecule depicting the main 
chemical bonds between its constituent molecules. Note that no chemical bonds link the 
nucleotide bases (designated by the letters in boxes) in the longitudinal message-bearing 
axis of the molecule. Note also that the same kind of chemical bonds link the different 
nucleotide bases to the sugar-phosphate backbone of the molecule (denoted by pentagons 
and circles). These two features of the molecule ensure that any nucleotide base can 
attach to the backbone at any site with equal ease, thus showing that the properties of the 
chemical constituents of DNA do not determine its base sequences. Adapted by 
permission from an original drawing by Fred Hereen. 





There in the classroom this elementary fact of DNA chemistry leaped out at me. I 
realized that explaining DNA’s infonnation-rich sequences by appealing to differential 
bonding affinities meant that there had to be chemical bonds of differing strength 
between the different bases along the infonnation-bearing axis of the DNA molecule. 
Yet, as it turns out, there are no differential bonding affinities there. Indeed, there is not 
just an absence of differing bonding affinities; there are no bonds at all between the 
critical information-bearing bases in DNA. 19 In the lecture hall the point suddenly struck 
me as embarrassingly simple: there are neither bonds nor bonding affinities — differing in 
strength or otherwise — that can explain the origin of the base sequencing that constitutes 
the infonnation in the DNA molecule. A force has to exist before it can cause something. 
And the relevant kind of force in this case (differing chemical attractions between 
nucleotide bases) does not exist within the DNA molecule. 

Of course it might be argued that although there are no bonds between the bases that 
explain the arrangement of bases, there might be either something else about the different 
sites on the DNA molecule that inclines one base rather than another to attach at one site 
rather than another. I investigated the possibility with a colleague of mine named Tony 
Mega, an organic chemist. I asked him if there were any physical or chemical differences 
between the bases or attachment sites on DNA that could account for base sequencing. 
We considered this question together for a while, but soon realized that there are no 
significant differential affinities between any of the four bases and the binding sites along 
the sugar-phosphate backbone. Instead, the same type of chemical bond (an N-glycosidic 
bond) occurs between the base and the backbone regardless of which base attaches. All 
four bases are acceptable; none is chemically favored. 

This meant there was nothing about either the backbone of the molecule or the way any 
of the four bases attached to it that made any sequence more likely to form than another. 
Later I found that the noted origin-of-life biochemist Bernd-Olaf Kuppers had concluded 
much the same thing. As he explained, “The properties of nucleic acids indicate that all 
the combinatorially possible nucleotide patterns of a DNA are, from a chemical point of 
view, equivalent.”" 0 In sum, two features of DNA ensure that “self-organizing” bonding 
affinities cannot explain the specific arrangement of nucleotide bases in the molecule: (1) 
there are no bonds between bases along the information-bearing axis of the molecule and 
(2) there are no differential affinities between the backbone and the specific bases that 
could account for variations in sequence. 

While I was teaching I developed a visual analogy to help my students understand why 
these two features of the chemical structure of DNA have such devastating implications 
for self-organizational models, at least those that invoke bonding affinities to explain the 
DNA enigma. When my children were young, they liked to spell messages on the 
metallic surface of our refrigerator using plastic letters with little magnets on the inside. 
One day I realized that the communication system formed by the refrigerator and 
magnetic letters had something in common with DNA — something that could help me 
explain why the information in DNA is irreducible to physical or chemical forces of 
attraction. 




To demonstrate this to my students, I would bring a small magnetic “chalkboard” to class 
with a message spelled on it, such as “Biology Rocks!” using the same kind of plastic 
letters that my kids used at home. I would point out that the magnetic forces between the 
letters and the metallic surface of the chalkboard explain why the letters stick to the 
board, just as forces of chemical attraction explain why the nucleotides stick to the 
sugar-phosphate backbone of the DNA. But I would also point out that there are no 
significant forces of attraction between the individual letters that determine their 
arrangement, just as there are no significant forces of attraction between the bases in the 
information-bearing axis of the DNA molecule. Instead, the magnetic forces between the 
letters and the chalkboard allow numerous possible letter combinations, some of which 
convey functional or meaningful information and most of which do not. 

To demonstrate that the magnetic forces do not dictate any specific letter sequence, I 
arranged and rearranged the letters on the board to show that they have perfect physical 
freedom to spell other messages or mere gibberish. I would further note that there are no 
differential forces of attraction at work to explain why one letter sticks to the chalkboard 
at one location rather than another. I then pointed out that the same is true of DNA: there 
are no differential forces of attraction between the DNA bases and the sites on the 
sugar-phosphate backbone. In the case of the magnetic board nothing about the magnetic 
force acting on the “B” inclines it to attach to the board at the front of the “Biology 
Rocks!” sequence rather than at the back or middle. Instead, each letter can attach to the 
chalkboard at any location, just as any one of the nucleotide bases can attach to any sugar 
molecule along the sugar-phosphate backbone of DNA. Forces of attraction 
(N-glycosidic bonds) do explain why the bases in DNA attach to the backbone of the 
molecules, but they do not explain why any given nucleotide base attaches to the 
molecule at one site rather than another. Nor, given the absence of chemical bonds 
between the bases, do any other bonding affinities or chemical forces internal to the 
molecule explain the origin of DNA base sequencing. 

I later learned that Kenyon already had reckoned with these same stubborn facts of 
molecular biology and reached the same conclusion about the futility of explaining the 
arrangement of DNA bases by reference to internal bonding affinities. The properties of 
the building blocks of DNA simply do not make a particular gene, let alone life as we 
know it, inevitable. Yet the opposite claim is often made by self-organizational theorists, 
albeit without much specificity. De Duve states, for example, that “the processes that 
generated life” were “highly deterministic,” making life as we know it “inevitable,” given 
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“the conditions that existed on the prebiotic earth.” Yet imagine the most favorable 
prebiotic conditions. Imagine a pool of all four DNA bases and all necessary sugars and 
phosphates. Would any particular genetic sequence inevitably arise? Given all necessary 
monomers, would any particular functional protein or gene, let alone a specific genetic 
code, replication system, or signal transduction circuitry, inevitably arise? Clearly not. 

The most obvious place to look for self-organizing properties to explain the origin of 
genetic information is in the constituent parts (the monomers) of the molecules that carry 
that information; but biochemistry makes clear that forces of attraction between the bases 




in DNA do not explain the specific sequences in these large information-bearing 
molecules. Because the same is true of RNA, researchers who speculate that life began in 
a self-organizing RNA world also must confront another sequencing problem, in 
particular, the problem of explaining how infonnation in a functioning RNA molecule 
could have first arisen in the absence of differential forces of chemical attraction between 
the bases in RNA 23 



The Mystery of the Code 

Recall from Chapter 5 that, in addition to the specified information in nucleic acids and 
proteins, origin-of-life researchers need to account for the origin of the integrated 
complexity of the gene-expression and-translation system. Recall also that the 
gene-expression system not only utilizes the digital information inscribed along the spine 
of the DNA molecule (the genetic text); it also depends on the genetic code or translation 
system imbedded in the tRNA molecule (along with its associated synthetase proteins). 
(The genetic code is to the genetic information on a strand of DNA as the Morse code is 
to a specific message received by a telegraph operator.) 

It turns out that it is just as difficult to explain the origin of the genetic code by reference 
to self-organizational bonding affinities as it is to explain the origin of the genetic “text” 
(the specific sequencing of the DNA bases). The next several paragraphs describe why in 
some detail. Those unfamiliar with the relevant facts may find it useful to consult the 
accompanying figure (Figure 11.3) or to skip ahead and pick up the main thread of the 
argument beginning with the heading “The Necessity of Freedom.” 

Self-organizational theories have failed to explain the origin of the genetic code for 
several reasons. First, to explain the origin of the genetic code, scientists need to explain 
the origin of the precise set of correspondences between specific nucleotide triplets in 
DNA (or codons on the messenger RNA) and specific amino acids (carried by transfer 
RNA). Yet molecular biologists have failed to find any significant chemical interaction 
between the codons on mRNA (or the anticodons on tRNA) and the amino acids on the 
acceptor arm of tRNA to which the codons correspond. This means that forces of 
chemical attraction between amino acids and these groups of bases do not explain the 
correspondences that constitute the genetic code. 
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Figure 11.3. The transfer-RNA molecule showing the anticodon on one end of the 
molecule and an attached amino acid on the other. Notice that there is no direct chemical 
interaction between the amino acid and the nucleotide codon that specifies it. 



Instead, the mRNA codon binds not to the amino acid directly, but to the anticodon triplet 
on the tRNA molecule. Moreover, the anti-codon triplet and amino acid are situated at 
opposite ends of tRNA. They do not interact chemically in any direct way. Although 
amino acids do interact chemically with a nucleotide triplet at the 3’ acceptor end of the 
tRNA molecule, the triplet remains the same in all twenty tRNA molecules, meaning the 
bonds there do not display any differential affinities that could explain the differing 
amino-acid assignments that constitute the code. All twenty tRNA molecules have the 
same final triplet of bases (ACC) at the 3’ arm where their amino acids attach. Since all 
twenty amino acids in all twenty tRNA molecules attach to the same nucleotide sequence, 
the properties of that nucleotide sequence clearly do not determine which amino acids 
attach and which do not. The nucleotide sequence is indifferent to which amino acid 
binds to it (just as the sugar-phosphate backbone in DNA is indifferent to which 
nucleotide base binds to it). All twenty triplets are acceptable; none is preferred. 

Thus, chemical affinities between nucleotide codons and amino acids do not determine 
the correspondences between codons and amino acids that define the genetic code. From 
the standpoint of the properties of the constituents that comprise the code, the code is 



physically and chemically arbitrary. All possible codes are equally likely; none is favored 
chemically. 

Moreover, the discovery of seventeen variant genetic codes has put to rest any doubt 
about this. - The existence of many separate codes (multiple sets of codon-amino acid 
assignments) in different microorganisms indicates that the chemical properties of the 
relevant monomers allow more than a single set of codon-amino acid assignments. The 
conclusion is straightforward: the chemical properties of amino acids and nucleotides do 
not determine a single universal genetic code; since there is not just one code, “it” cannot 
be inevitable. 

Instead, scientists now know the codon-amino acid relationships that define the code are 
established and mediated by the catalytic action of some twenty separate proteins, the 
so-called aminoacyl-tRNA synthetases (one for each tRNA anticodon and amino-acid 
pair). Each of these proteins recognizes a specific amino acid and the specific tRNA with 
its corresponding anticodon and helps attach the appropriate amino acid to that tRNA 
molecule. Thus, instead of the code reducing to a simple set of chemical affinities 
between a small number of monomers, biochemists have found a functionally 
interdependent system of highly specific biopolymers, including mRNA, twenty specific 
tRNAs, and twenty specific synthetase proteins, each of which is itself constructed via 
information encoded on the very DNA that it helps to decode. But such integrated 
complexity was just what needed explanation in the first place. The attempt to explain 
one part of the integrated complexity of the gene-expression system, namely, the genetic 
code, by reference to simple chemical affinities leads not to simple rules of chemical 
attraction, but instead to an integrated system of large molecular components. One aspect 
of the DNA enigma leads to another. 

Certainly, the chemical interactions between the functional polymers in this complex 
translation system proceed deterministically. But to explain how a windmill or an 
operating system or a genetic code works is one thing; to explain how any of them 
originated is quite another. To claim that deterministic chemical affinities made the origin 
of this system inevitable lacks empirical foundation. Given a pool of the bases necessary 
to tRNA and mRNA, given all necessary sugars and phosphates and all twenty amino 
acids used in proteins, would the molecules comprising the current translation system, let 
alone any particular genetic code, have had to arise? Indeed, would even a single 
synthetase have had to arise from a pool of all the necessary amino acids? Again, clearly 
not. 

As origin-of-life biochemists have taught us, monomers are “building blocks.” And like 
the building blocks that masons use, molecular building blocks can be arranged and 
rearranged in innumerable ways. The properties of stone blocks do not detennine their 
arrangement in the construction of buildings. Similarly, the properties of biological 
building blocks do not determine the arrangement of monomers in functional DNA, 
RNA, or proteins. Nor do they determine the correspondences between DNA bases and 
the amino acids that constitute the genetic code. Instead, the chemical properties of the 




building blocks of these molecules allow a vast ensemble of possible configurations and 
associations, the overwhelming majority of which would have no biological function. 
Thus, functional genes and proteins are no more inevitable, given the properties of their 
“building blocks,” than the palace of Versailles was inevitable, given the properties of the 
bricks and stone used to construct it. 



The Necessity of Freedom 

As I thought more about the chemical structure of DNA and tRNA, Polanyi’s deeper 
point about the expression of infonnation requiring chemical indeterminacy, or freedom, 
came into clearer focus. Polanyi had argued that if “forces of potential energy” 
detennined the arrangement of the bases, “the code-like character” of the molecule 
“would be effaced by an overwhelming redundancy .” 25 I now understood why. 

Consider, for example, what would happen if the individual nucleotide bases (A, C, G, T) 
in the DNA molecule did interact by chemical necessity (along the information-bearing 
axis of DNA). Suppose that every time adenine (A) occurred in a growing genetic 
sequence, it attracted cytosine (C) to it, which attracted guanine (G), which attracted 
thymine (T), which attracted adenine (A), and so on. If this were the case, the 
longitudinal axis of DNA would be peppered with repetitive sequences of ACGT. Rather 
than being a genetic molecule capable of virtually unlimited novelty and characterized by 
unpredictable and aperiodic sequences, DNA would contain sequences awash in 
repetition or redundancy — much like the arrangement of atoms in crystals. 

To see why, imagine a group of captive soldiers are told they can type letters to their 
families at home. The only condition is that they have to begin with the lower case letter 
a, follow it with the next letter in the alphabet, b, then the next and the next, moving 
through the entire alphabet, circling back to a at the end and then continuing the process 
until they have filled in the sheet of typing paper. They are instructed to follow the same 
process for filling out the front of the envelope. Finally, if they don’t feel up to the task or 
if they accidentally strike a wrong key, the prison guards will take over the task for them 
and “do it properly.” 

This would, of course, be nothing more than a cruel joke, for the soldiers couldn’t 

communicate a single bit of information with their letters or even mail the letters to the 

appropriate addresses — all because the content of the letters was inevitable, strictly 

governed by the lawlike algorithm forced on them by the prison guards. In the same way, 

the lawlike forces of chemical necessity produce redundancy (repetition), which reduces 

the capacity to convey information and express novelty. Instead, information emerges 

from within an environment marked by indetenninacy, by the freedom to arrange parts in 

many different ways. As the MIT philosopher Robert Stalnaker puts it, information 
• • 27 

content “requires contingency.” 



Infonnation theory reveals a deeper reason for this. Recall that classical information 
theory equates the reduction of uncertainty with the transmission of infonnation, whether 




specified or unspecified. It also equates improbability and infonnation — the more 
improbable an event, the more infonnation its occurrence conveys. In the case that a 
law-like physical or chemical process determines that one kind of event will necessarily 
and predictably follow another, then no uncertainty will be reduced by the occurrence of 
such a high-probability event. Thus, no infonnation will be conveyed. Philosopher of 
science Fred Dretske, the author of an influential book on infonnation theory, explains it 
this way: “As p(si) [the probability of a condition or state of affairs] approaches 1 [i.e., 
certainty], the amount of infonnation associated with the occunence of si goes to 0. In 
the limiting case when the probability of a condition or state of affairs is unity [p(si) =1], 
no infonnation is associated with, or generated by, the occurrence of si. This is merely 
another way to say that no infonnation is generated by the occurrence of events for which 
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there are no possible alternatives.” 

Dretske’s and Polanyi’s observations were decisive: to the extent that forces of attraction 
among the members of a sequence determine the anangement of the sequence, the 
information-carrying capacity of the system will be diminished or effaced by 
redundancy. Bonding affinities, to the extent they exist, inhibit the production of 
information, because they detennine that specific outcomes will follow specific 
conditions with high probability. Information-carrying capacity is maximized, 
however, when the opposite situation occurs, namely, when chemical conditions allow 
many improbable outcomes. 

Polanyi appreciated this point, but also its converse. He knew that it was precisely 
because the sequences of bases in DNA were not biochemically determined (or 
predestined) that the molecule could store and transmit information. Because any 
nucleotide can follow any other, a vast array of sequences is possible, which allows DNA 
to encode a vast number of protein functions. As he explains, “It is this physical 
indeterminacy of the sequence that produces the improbability of occurrence of any 
particular sequence and thereby enables it to have a meaning — a meaning that has a 
mathematically detenninate infonnation content equal to the numerical improbability of 
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the arrangement.” 

As noted in Chapter 4, the base sequences in DNA not only possess infonnation-carrying 
capacity as measured by classical Shannon infonnation theory, they also store 
functionally specified information; they are specified as well as complex. Clearly, 
however, a sequence cannot be both specified and complex if it is not at least complex. 
Therefore, self-organizational forces of chemical necessity, which produce redundant 
order and preclude complexity, preclude the generation of specified complexity (or 
specified information) as well. Lawlike chemical forces do not generate complex 
sequences. Thus, they cannot be invoked to explain the origin of infonnation, whether 
specified or otherwise. 



Conclusion 




At a small private conference in 1993, I had a chance to meet and talk with Kenyon 
personally for the first time. I learned that he had come to the same conclusion about his 
theory of “biochemical predestination” as I had. It now seemed clear to both of us that 
there was a significant, in principle, objection to the very idea that chemical attractions 
could produce infonnation as opposed to simple redundant order. Indeed, if Kenyon had 
found that bonding affinities between nucleotide bases in DNA determined their 
sequencing, he would have also discovered that chemists had been mistaken about 
DNA’s information-bearing properties. Yet no one doubted DNA’s capacity to store 
information. 

But were there other such theories that could explain what Kenyon’s could not? 
Self-organizational models emphasizing internal bonding affinities had failed, but 
perhaps models emphasizing lawlike forces external to DNA would succeed. And 
although simple algorithms like “repeat ATCG” clearly lacked the capacity to convey 
biological infonnation, perhaps some far more sophisticated algorithm or some dance of 
forces between a large group of molecules could cause life (or the infonnation it 
required) to self-organize. By the early 1990s, some researchers were proposing models 
of just this kind. 
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Thinking Outside the Bonds 



There’s an oft-told tale, apocryphal but nonetheless instructive, of Christopher Columbus 
dining with several men who are unimpressed by his discovery of the New World. 
Columbus, not surprisingly, objects to their disdain. Certainly the way to the New World 
is obvious now, but if finding the way had been so easy, why hadn’t someone done it 
before? To illustrate what he’d achieved, he asks for an egg and challenges the men at the 
table to balance it on its head. Each man tries and fails. Columbus then takes the egg and 
offers a simple solution: he sets the egg down just hard enough to break the end, allowing 
it to remain upright. 

Today we have an expression to describe innovative thinking that transcends 
unnecessary, but established constraints. We say that innovative problem solvers “thi nk 
outside the box.” Following Kenyon and Polanyi, I had come to see that differences in 
internal bonding affinity, either between nucleotide bases or amino acids, did not and 
could not solve the DNA enigma. But if the internal affinities were inside the box, both 
proverbially and literally, perhaps the failure of models relying on such forces merely 
signaled the need for researchers to journey outside the box. And that’s precisely what 
subsequent self-organizational researchers tried to do. 



External Self-Organizational Forces: Just Add Energy? 

If internal bonding affinities did not explain the information in DNA and proteins, might 
there be some ubiquitous external force that caused the bases in DNA (or amino acids in 
proteins) to align themselves into information-rich sequences? Magnetic forces cause iron 
filings to align themselves into orderly “lines of force” around the magnet. Gravitational 
forces create vortices in draining bathtubs. Perhaps some pervasive self-organizing forces 
external to DNA and proteins could explain the origin of the information-rich 
biomolecules or other forms of biological organization. 

In 1977 the Russian-born Belgian physicist Ilya Prigogine wrote a book with a colleague, 
Gregoire Nicolis, exploring this possibility. Prigogine specialized in thermodynamics, the 
science of energy and heat. He became interested in how energy flowing into a system 
could cause order to arise spontaneously. His work documenting this phenomenon won 
him the Nobel Prize in Chemistry in 1977, the same year he published his book with 
Nicolis. 

In their book Self-Organization in Nonequilibrium Systems, Prigogine and Nicolis 
suggested that energy flowing into primitive living systems might have played a role in 



the origin of biological organization. They characterized living organisms as open 
systems that maintain their particular form of organization by utilizing large quantities of 
energy and matter from the environment (and by “dissipating” large quantities of energy 
and matter into the environment). 1 An open system is one that interacts with the 
environment and whose behavior or structure is altered by that interaction. Prigogine 
demonstrated that open systems driven far from equilibrium (i.e., driven far from the 
nonnal state they would occupy in the absence of the environmental input) often display 
self-ordering tendencies as they receive an input of energy. For example, thermal energy 
flowing through a heat sink will generate distinctive convection currents or “spiral wave 
activity.” 

In their book, Prigogine and Nicolis suggested that the organized structures observed in 
living systems might have similarly “self-originated” with the aid of an energy source. 
They conceded the improbability of simple building blocks arranging themselves into 
highly ordered structures under normal equilibrium conditions. Indeed, Prigogine 
previously had characterized the probability of living systems arising by chance alone as 
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“vanishingly small.” But now he and Nicolis suggested that under nonequilibrium 
conditions, where an external source of energy is supplied, biochemical systems might 
arrange themselves into highly ordered patterns and primitive biological structures. 



Order Versus Information 

When I first learned about Prigogine and Nicolis’s theory and the analogies by which 
they justified it, it did seem plausible. But as I considered the merits of their proposal, I 
discovered that it had an obvious defect, one that the prominent information theorist 
Hubert Yockey described to me in an interview in 1986. Yockey pointed out that 
Prigogine and Nicolis invoked external self-organizational forces to explain the origin of 
order in living systems. But, as Yockey noted, what needs explaining in biological 
systems is not order (in the sense of a symmetrical or repeating pattern), but information, 
the kind of specified digital information found in software, written languages, and DNA. 

Energy flowing through a system may produce highly ordered patterns. Strong winds 
form swirling tornados and the “eyes” of hurricanes; Prigogine’s thermal baths develop 
interesting convection currents; and chemical elements coalesce to form crystals. But 
Yockey insisted that this kind of symmetric order has little to do with the specified 
complexity or information in DNA, RNA, and proteins. To say otherwise conflates two 
distinct types of patterns or sequences. 

As was my habit, I developed a visual illustration to convey this point to my college 
students. Actually, I borrowed the visual aid from the children of a professor friend who 
lived in my neighborhood. The homemade toy his children played with was meant to 
entertain, but I realized that it perfectly illustrated Yockey ’s distinction between order 
and specified complexity and his critique of Prigogine’s self-organizational model. 




The toy was made of two one-liter soda bottles that were sealed and fastened together at 
each opening by a red plastic coupling. The two bottles together made one large 
hourglass shape. The device also contained a turquoise liquid (probably water with food 
coloring) and some silver flecks that would sparkle as the liquid swirled around. Liquid 
from one bottle could flow into the other bottle. The children liked to hold the bottles 
upright until all the liquid from the top bottle flowed into the bottom one. Then they 
would quickly turn the whole apparatus over and give it a sudden shake by the narrow 
neck. Next they would watch as the blue liquid would organize into a swirling vortex in 
the top bottle and begin to drain into the bottom bottle. 

After convincing the children to lend me their toy, I used it in class to illustrate how an 
infusion of energy could spontaneously induce order in a system. This was an important 
point to establish with my students, because some of them had heard creationist 
arguments about how the second law of thermodynamics dictates that order in nature 
always dissipates into disorder over time. Prigogine had shown that, although disorder 
will ultimately increase over time in a closed system such as our whole universe, order 
may arise from disorder spontaneously when energy enters into smaller (than the 
universe) open systems. I used the big blue vortex maker to demonstrate that order can, 
indeed, arise from an infusion of energy (in this case, a sudden shake and flipping of the 
apparatus) into an open system. 

Nevertheless, I also wanted my students to understand that there was a difference 
between order and specified complexity and why that distinction called into question the 
ultimate relevance of Prigogine’s ideas. To illustrate this, I would ask them to focus on 
the individual flecks sparkling within the swirling blue liquid. Could they see any 
interesting arrangements of these flecks that performed a communication function? Did 
the flecks spell any messages or encode any digital infonnation? Obviously, the answer 
was no. Students could see a highly random arrangement of sparkling flecks. They also 
could see an orderly pattern in the motion of the liquid as a whole as the swirling blue 
water formed the familiar funnel shape of a vortex. Nevertheless, they could not detect 
any specified or functional information, no interesting patterns forming sequences of 
alphabetic or digital characters. 

My students had no trouble comprehending the point of my somewhat crude illustration. 
Energy flowing through an open system will readily produce order. But it does not 
produce much specified complexity or information. 

The astrophysicist Fred Hoyle had a similar way of making the same point. He famously 
compared the problem of getting life to arise spontaneously from its constituent parts to 
the problem of getting a 747 airplane to come together from a tornado swirling through a 
junk yard. An undifferentiated external force is simply too blunt an instrument to 
accomplish such a task. Energy might scatter parts around randomly. Energy might 
sweep parts into an orderly structure such as a vortex or funnel cloud. But energy alone 
will not assemble a group of parts into a highly differentiated or functionally specified 




system such an airplane or cell (or into the informational sequences necessary to build 
one). 



Kenyon’s self-organizational model had already encountered this problem. He came to 
realize that, although internal chemical affinities might produce highly repetitive or 
ordered sequences, they certainly did not produce the information-rich sequences in 
DNA. Now a similar problem reemerged as scientists considered whether lawlike 
external forces could have produced the information in DNA. Prigogine’s work showed 
that energy in an open system can create patterns of symmetrical order. But it provided 
no evidence that energy alone can encode functionally specified infonnation-rich 
sequences — whether biochemical or otherwise. Self-organizational processes explain 
well what doesn’t need explaining in life. 

It’s actually hard to imagine how such self-organizing forces could generate or explain 
the specificity of arrangement that characterizes infonnation-rich living systems. In my 
vortex maker, an externally induced force infused energy through the system, sweeping 
all the constituents of the system along basically the same path. In Prigogine’s convection 
baths, an energy source established a pattern of motion throughout the system that 
affected all the molecules in a similar way, rather than arranging them individually and 
specifically to accomplish a function or convey a message. Yet character-by-character 
variability and specificity of arrangement are hallmarks of functional infonnation-rich 
sequences. Thus, as Yockey notes: “Attempts to relate the idea of order... with biological 
organization or specificity must be regarded as a play on words that cannot stand careful 
scrutiny. Informational macromolecules can code genetic messages and therefore can 
carry infonnation because the sequence of bases or residues is affected very little, if at 
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all, by [self-organizing] physicochemical factors.” 



The Limits of the Algorithm 

As a result of these difficulties, few, if any, scientists now maintain that Prigogine and 
Nicolis solved the problem of the origin of biological information. Nevertheless, some 
scientists continued to hope that further research would identify a specific 
self-organizational process capable of producing biological infonnation. For example, 
biophysicist Manfred Eigen suggested in 1992 that “Our task is to find an algorithm, a 
natural law that leads to the origin of information.” 4 

This sounded good, but I began to wonder whether any lawlike process could produce 
information. Laws, by definition, describe events that repeatedly and predictably recur 
under the same conditions. One version of the law of gravity states that “all unsuspended 
bodies will fall.” If I lift a ball above the earth and let it go, it will fall. Every time. 
Repeatedly. Another law states that “water heated to 212 degrees Fahrenheit at sea level 
will boil and produce steam.” Apply heat to a pan of water. Watch and wait. Bubbles and 
steam will appear. Predictably. Laws describe highly predictable and regular conjunctions 
of events — repetitive patterns, redundant order. They do not describe the kind of 
complexity necessary to convey information. 




Here’s another way to think of it. Scientific laws often describe predictable relationships 
between antecedent conditions and consequent events. Many scientific laws take the 
form, “If A occurs, then B will follow, given conditions C.” If the conditions C are 
present, and an event of type A occurs, then an event of type B will follow, predictably 
and “of necessity.” Thus, scientific laws describe patterns in which the probability of 
each successive event (given the previous event) approaches one, meaning the 
consequent must happen if the antecedents are present. Yet, as noted previously, events 
that occur predictably and “of necessity” do not convey infonnation. Instead, information 
arises in a context of contingency. Information mounts as improbabilities multiply. Thus, 
to say that scientific laws generate complex infonnational patterns is essentially a 
contradiction in terms. If a process is orderly enough to be described by a law, it does not, 
by definition, produce events complex enough to convey information. 

Of course, lawlike processes might transmit information that already exists in some other 
form, but such processes do not generate specified information. To see why, imagine that 
a group of small radio-controlled helicopters hovers in tight formation over a football 
stadium, the Rose Bowl in Pasadena, California. From below, the helicopters appear to be 
spelling a message: “Go USC.” At halftime with the field cleared, each helicopter 
releases either a red or gold paint ball, one of the two University of Southern California 
colors. The law of gravity takes over and the paint balls fall to the earth, splattering paint 
on the field after they hit the turf. Now on the field below, a somewhat messier but still 
legible message appears: “Go USC.” 

Did the law of gravity, or the force described by the law, produce this infonnation? 
Clearly, it did not. The infonnation that appeared on the field already existed in the 
anangement of the helicopters above the stadium — in what physicists call the “initial 
conditions.” Neither the force of gravity nor the law that describes it caused the 
information on the field to self-organize. Instead, gravitational forces merely transmitted 
preexisting information from the helicopter fonnation — the initial conditions — to the 
field below. 

Sometimes when I’ve used these illustrations I’ve been asked: “But couldn’t we discover 
a very particular configuration of initial conditions that generates biological infonnation? 
If we can’t hope to find a law that produces information, isn’t it still possible to find a 
very particular set of initial conditions that generates infonnation in a predictable law-like 
way?” But this objection just restates the basic self-organizational proposal in new words. 
It also again begs the question of the ultimate origin of information, since “a very 
particular set of initial conditions” sounds precisely like an information-rich — a highly 
complex and specified — state. As I would later discover, however, this wasn’t the only 
proposal to beg the question about the ultimate origin of infonnation. Instead, attempts to 
explain the origin of infonnation by reference to some prior set of conditions invariably 
shifted — or displaced — the problem someplace else. My first inkling of this problem 
came as I reflected on perhaps the most innovative — “outside the 

box” — self-organizational proposal of all. 




The Kauffman Model 



If neither internal bonding affinities between the constituents of DNA nor ubiquitous 
external forces acting upon those constituents can account for the specific sequence of 
the DNA bases, then what was left? It didn’t seem to me that there could be much left, 
since “forces external” and “forces internal” to the molecule seemed to exhaust the set of 
possibilities. Yet, even so, I knew that another player was about to step onto the stage, 
Stuart Kauffman. 

Kauffman is a brilliant scientist who trained as a physician at the University of 
California, San Francisco, and later worked as professor of biochemistry and biophysics 
at the University of Pennsylvania before leaving to help head up the Santa Fe Institute, a 
research institute dedicated to the study of complex systems. During the early 1990s, as I 
was beginning to examine the claims of self-organizational theories, I learned that 
Kauffman was planning to publish a treatise advancing a new self-organizational 
approach. His new book promised to bring the progress made at Santa Fe to bear on the 
problem of the origin of life and indeed to make significant steps toward solving that 
problem within a self-organizational framework. His long-anticipated book was titled The 
Origins of Order: Self-Organization and Selection in Evolution. 

I remember the day Kauffman’s book finally arrived at the college where I taught. I 
quickly opened the package in my office only to discover a rather imposing 
seven-hundred-page behemoth of a book. After I scanned the table of contents and read 
the opening chapters, it became clear that much of Kauffman’s book provided a rather 
generalized discussion of the mathematical properties of complex systems. His specific 
proposal for explaining the origin of life occupied less than eighty pages of the book and 
was mainly confined to one chapter. My curiosity got the better of me. I decided to read 
this section first and return to the rest of the book later. 





Figure 12.1. Dr. Kauffman’s primary work has been as a theoretical biologist studying 
the origin of life and molecular organization. Courtesy of the University of Calgary. 



Kauffman had, in keeping with his reputation as an “outside the box” thinker, made a 
bold and innovative proposal for explaining the origin of life. Rather than invoking either 
external lawlike forces or internal bonding affinities to explain how biological 
information had self-organized, his hypothesis sought to transcend the problem 
altogether. He proposed a self-organizational process that could bypass the need to 
generate genetic information. 

Kauffman attempted to leapfrog the “specificity” (or information) problem by proposing 
a means by which a self-reproducing metabolic system might emerge directly from a set 
of “low-specificity” catalytic peptides and RNA molecules in a prebiotic soup, or what he 
called a “chemical minestrone.” 5 A metabolic system is a system of molecules that react 
with each other inside a living cell in order to sustain a vital function. Some metabolic 
systems break down molecules to release energy; others build molecules to store energy 
(as occurs during ATP synthesis) or information (as occurs in DNA replication or protein 
synthesis). In extant forms of life, these reactions usually involve or are mediated by a 
number of highly specific enzyme catalysts and other proteins. As such, Kauffman’s 
proposal represents a kind of protein-first theory similar in some respects to Kenyon’s 
earlier model. 

Nevertheless, Kauffman suggests, unlike Kenyon, that the first metabolic system might 
have arisen directly from a group of low- specificity polypeptides. He proposes that once a 
sufficiently diverse set of catalytic molecules had assembled (in which the different 
peptides performed enough different catalytic functions, albeit inefficiently), the 
ensemble of individual molecules spontaneously underwent a kind of phase transition 
(akin to crystallization) resulting in a self-reproducing metabolic system. Kauffman 
envisions, as the historian of biology Iris Fry puts it, “a set of catalytic polymers in which 
no single molecule reproduces itself but the system as a whole does.” 6 In this way, 
Kauffman argues that metabolism (and the proteins necessary to it) could have arisen 
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directly without genetic information encoded in DNA. 

Kauffman’s model was clearly innovative and arguably more sophisticated than many 
previous self-organizational models. Unlike previous models, Kauffman’s claims that an 
ensemble of relatively short and “low-specificity” catalytic peptides and RNA molecules 
would together be enough to establish a metabolic system. He defends the biochemical 
plausibility of his scenario on the grounds that some proteins can perform enzymatic 
functions with low specificity and complexity. To support his claim, he cites a class of 
proteins known as proteases (including one in particular, called trypsin) that cleave 
peptide bonds at single amino-acid sites. But is he right? 

As I thought more about Kauffman’s proposal and researched the properties of proteases, 
I became convinced that his proposal did not solve or successfully bypass the problem of 




the origin of biological infonnation. Kauffman himself acknowledges that, as yet, there is 
no experimental evidence showing that such autocatalysis could occur. But, beyond that, 
I realized that Kauffman had either presupposed the existence of unexplained sequence 
specificity or transferred the need for specified information out of view. In fact, 
Kauffman’s model has at least three significant infonnation-related problems. 

First, it does not follow, nor is it the case biochemically, that just because some enzymes 
might function with low specificity, that all the catalytic peptides (or enzymes) needed to 
establish a self-reproducing metabolic cycle could function with similarly low levels of 
specificity and complexity. Instead, modern biochemistry shows that most and usually all 
of the molecules in a closed interdependent metabolic system of the type that Kauffman 
envisions require high-complexity and-specificity proteins. Enzymatic catalysis (which 
his scenario would surely require) needs molecules long enough to form tertiary 
structures. Tertiary structures are the three-dimensional shapes of proteins, which provide 
the spatial positioning of critical amino-acid residues to convey particular or specialized 
functions. 9 How long a protein chain needs to be to form one of those shapes depends on 
its complexity. For the very simplest shapes, something like forty or fifty amino acids are 
needed. More complicated shapes may require several hundred amino acids. Further, 
these long polymers require very specific three-dimensional geometries (which in turn 
derive from sequence-specific arrangements of monomers) in order to catalyze necessary 
reactions. How do these molecules acquire their specificity of sequencing? Kauffman 
does not address this question, because his model incorrectly suggests that he does not 
need to do so. 

Second, I discovered that even the allegedly low-specificity molecules (the proteases) 
that Kauffman cites to illustrate the plausibility of his scenario are actually very complex 
and highly specific in their sequencing. I also discovered that Kauffman confuses the 
specificity and complexity of the parts of the polypeptides upon which the proteases act 
with the specificity and complexity of the proteins (the proteases) that do the enzymatic 
acting. Though trypsin, for example, acts upon — cleaves — peptide bonds at a relatively 
simple target (the carboxyl end of two separate amino acids, arginine and lysine), trypsin 
itself is a highly complex and specifically sequenced molecule. Indeed, trypsin is a 
non-repeating 247-amino-acid protein that possesses significant sequence specificity as a 
condition of function. 10 

Further, trypsin has to manifest significant three-dimensional (geometric) specificity in 
order to recognize the specific amino acids arginine and lysine, at which sites it cleaves 
peptide bonds. By equivocating in his discussion of specificity, Kauffman obscures from 
view the considerable specificity and complexity requirements of the proteases he cites to 
justify his claim that low-specificity catalytic peptides will suffice to establish a 
metabolic cycle. Thus, Kauffman’s own illustration, properly understood (i.e., without 
equivocating about the relevant locus of specificity), shows that for his scenario to have 
biochemical plausibility, it must presuppose the existence of many high-complexity 
and-specificity polypeptides and polynucleotides. Where does this information in these 
molecules come from? Kauffman again does not say. 




Third, Kauffman acknowledges that for autocatalysis to occur, the molecules in the 
chemical minestrone must be held in a very specific spatial-temporal relationship to one 
another. 1 1 In other words, for the direct autocatalysis of integrated metabolic complexity 
to occur, a system of catalytic peptide molecules must first achieve a very specific 
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molecular configuration (or what chemists call a “low-configurational entropy state”). 
This requirement is equivalent to saying that the system must start with a large amount of 
specified information or specified complexity. By Shannon’s theory of information, 
information is conveyed every time one possibility is actualized and others are excluded. 
By admitting that the autocatalysis of a metabolic system requires a specific arrangement 
of polypeptides (only one or a few of the possible molecular arrangements, rather than 
any one of the many possibilities) Kauffman tacitly concedes that such an arrangement 
has a high-information content. Thus, to explain the origin of specified biological 
complexity at the systems level, Kauffman has to presuppose a highly specific 
arrangement of those molecules at the molecular level as well as the existence of many 
highly specific and complex protein and RNA molecules (see above). In short, Kauffman 
merely transfers the information problem from the molecules into the soup. 

In addition to these problems, Kauffman’s model encounters some of the same problems 
that Kenyon’s protein- first model and other metabolism-first models encounter. It does 
not explain (a) how the proteins in various metabolic pathways came into association 
with DNA and RNA or any other molecular replicator or (b) how the information in the 
metabolic system of proteins was transferred from the proteins into the DNA or RNA. 
And it gives no account of (c) how the sequence specificity of functional polypeptides 
arose (given that the bonding affinities that exist among amino acids don’t correlate to 
actual amino-acid sequences in known proteins). 

Robert Shapiro, a leading chemist at New York University, has recently proposed that 
origin-of-life researchers begin to investigate metabolism-first models of the kind that 
Kauffman proposed. Shapiro argues that these models have several advantages that other 
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popular origin-of-life scenarios (particularly RNA-first models, see Chapter 14) don’t. 
Though Shapiro favors these metabolism-first approaches, he acknowledges that 
researchers have not yet identified what he calls a “driver reaction” that can convert small 
molecules into products that increase or “mobilize” the organization of the system as a 
whole. He also notes that researchers on metabolism- first models “have not yet 
demonstrated the operation of a complete [metabolic] cycle or its ability to sustain itself 
and undergo further evolution.” 14 In short, these approaches remain speculative and do 
not yet offer a way to solve the fundamental problem of the origin of biologically 
relevant organization (or information). 

In any case, I concluded that Kauffman’s self-organization model — to the extent it had 
relevance to the behavior of actual molecules — presupposes or transfers, rather than 
explains, the ultimate origin of the specified information necessary to the origin of a 
self-reproducing metabolic cycle. I wasn’t the only one to find Kauffman’s 
self-organizational model insufficient. Other scientists and origin-of-life researchers 
made similar criticisms. 15 Though many origin-of-life researchers have expressed their 




admiration for Kauffman’s innovative new approach, few, if any, think that his model 
actually solves the problem of the origin of infonnation or the origin of life. Perhaps, for 
this reason, after 1993 Kauffman proposed some new self-organizational models for the 
origin of biological organization. His subsequent proposals lacked the biological 
specificity of his bold, if ill-fated, original proposal. Nevertheless, Kauffman’s later 
models did illustrate just how difficult it is to explain the origin of information without 
presupposing other preexisting sources of information. 



Buttons and Strings 

In 1995 Kauffman published another book, At Home in the Universe, in which he 
attempted to illustrate how self-organizational processes might work using various 
mechanical or electrical systems, some of which could be simulated in a computer 
environment. 16 In one, he conceives a system of buttons connected by strings. The 
buttons represent novel genes or gene products, and the strings represent the lawlike 
forces of interaction between the gene products, namely, the proteins. Kauffman suggests 
that when the complexity of the system (as represented by the number of buttons and 
strings) reaches a critical threshold, new modes of organization can arise in the system 
“for free” — that is, without intelligent guidance — after the manner of a phase transition in 
chemistry, such as water turning to ice or the emergence of superconductivity in some 
metals when cooled below a certain temperature. 

Another Kauffman model involves a system of interconnected lights. Each light can flash 
in a variety of states — on, off, twinkling, and so forth. Since there is more than one 
possible state for each light and many lights, there are many possible states the system 
can adopt. Further, in his system, rules detennine how past states will influence future 
states. Kauffman asserts that, as a result of these rules, the system will, if properly tuned, 
eventually produce a kind of order in which a few basic patterns of light activity recur 
with greater than random frequency. Since these patterns represent a small portion of the 
total number of possible states for the system, Kauffman suggests that self-organizational 
laws might similarly produce a set of highly improbable biological outcomes within a 
much larger space of possibilities. 

But do these simulations accurately model the origin of biological information? It’s hard 
to think so. Kauffman’s model systems are not constrained by functional considerations 
and, thus, are not analogous to biological systems. A system of interconnected lights 
governed by preprogrammed rules may well settle into a small number of patterns within 
a much larger space of possibilities. But since these patterns need not meet any functional 
requirements, they fail at a fundamental level to model biological organisms. A system of 
lights flashing “Warning: Mudslide Ahead” would model a biologically relevant 
self-organizational process, at least if such a message arose without agents previously 
programming the system with equivalent amounts of functional information. Kauffman’s 
arrangements of flashing lights are not of this sort. They serve no function, and certainly 
no function comparable to the infonnation-rich molecules found in the biological realm. 




Kauffman’s model systems differ from biological information in another striking way. 
The series of information-bearing symbols we find in the protein-coding regions of DNA, 
in sophisticated software programs, or in the sentences on this page are aperiodic. The 
sequences of characters do not repeat in a rigid or monotonous way. Kauffman’s model, 
in contrast, is characterized by large amounts of symmetrical order or internal 
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redundancy interspersed with aperiodic sequences (mere complexity) lacking function. 
Getting a law-govemed system to generate repetitive patterns of flashing lights, even 
with a certain amount of variation, is clearly interesting, but not biologically relevant. 
Since Kauffman’s models do not produce functional structures marked by specified 
aperiodic symbol sequences such as we find in DNA, they do not serve as promising 
models for explaining the origin of biological information. 

But there is another fundamental problem with Kauffman’s model systems, one he tacitly 
acknowledges. To the extent that Kauffman’s systems do succeed in producing 
interesting nonrandom patterns, they do so only because of an unexplained intervention 
of information. For example, Kauffman notes that if his system of flashing lights is 
properly “tuned ,” 18 then it will shift from a chaotic regime to an orderly regime that will 
produce the outcomes or patterns he regards as analogous to processes or structures 
within living systems. By “tuning,” Kauffman means the careful setting of a particular 
“bias parameter” to make his system shift from a chaotic regime into one in which order 
is produced. In other words, the tuning of this parameter ensures that certain kinds of 
outcomes are actualized and others precluded. Such an act constitutes nothing less than 
an infusion of information. When someone tunes a radio dial or a musical instrument, he 
or she selects a certain frequency and excludes many others. Yet Shannon defined an 
informative intervention as precisely electing one option and excluding others. 

In his system of flashing lights, Kauffman briefly mentions that two of his 
collaborators — physicists Bernard Derrida and Gerard Weisbuch — were responsible for 
the “tuning” that produced the patterns he thinks analogous to order in living systems. 
Nevertheless, he does not think that any such agency played a role in the origin of life. 
Thus, even assuming for the sake of argument that his system of flashing lights manifests 
features analogous to those in living systems, his model still falls prey to what Dembski 
calls the “displacement problem” — the problem of explaining the origin of one 
(putatively) information-rich system only by introducing another unexplained source of 
information. 



Conclusion: The Displacement Problem 

As I examined Kauffman’s model, it occurred to me that I was beginning to see a pattern. 
Self-organizational models for the origin of biological organization were becoming 
increasingly abstract and disconnected from biological reality. Model systems such as 
Prigogine’s or Kauffman’s did not even claim to identify actual chemical processes that 
led in a life-friendly direction. Instead, these models claimed to describe processes that 
produced phenomena with some limited similarity to the organization found in living 




systems. Yet upon closer inspection these allegedly analogous phenomena actually 
lacked important similarities to life, in particular, the presence of specified complexity, or 
information. 

But beyond that, I realized that self-organizational models either failed to solve the 
problem of the origin of specified information, or they “solved” the problem only at the 
expense of introducing other unexplained sources of infonnation. Kauffman’s models 
provided only the best illustration of this latter “displacement problem.” In addition, the 
earlier models such as Kenyon’s or Prigogine’s, relying as they did on processes that 
produced order rather than complexity, each fell prey to both empirical and conceptual 
difficulties. They not only failed to explain the origin of information; they did so in a 
highly instructive way — one that helped to clarify our understanding of the nature of 
information and why it stands conceptually distinct from redundant order and the lawful 
processes that produce such order. 

Thus, despite the cachet associated with self-organizational theories as the “new wave” of 
thinking in evolutionary biology, I came to reject them as complete nonstarters, as 
theories that were unlikely ever to succeed regardless of the outcome of future empirical 
studies. In my view, these models either begged the question or invoked a logical 
contradiction. Proposals that merely transfer the infonnation problem elsewhere 
necessarily fail because they assume the existence of the very entity — specified 
information — they are trying to explain. And new laws will never explain the origin of 
information, because the processes that laws describe necessarily lack the complexity that 
informative sequences require. To say otherwise betrays confusion about the nature of 
scientific laws, the nature of information, or both. 

As I reflected on the failure of these models, my interest in the design hypothesis 
increased. But the reason for this was not just that self-organizational scenarios had 
failed. Instead, it was that self-organizational theories failed in a way that exposed the 
need for an intelligent cause to explain the relevant phenomena. Remember my magnetic 
chalkboard demonstration? I had used that demonstration to show that chemical forces of 
attraction don’t explain the specific arrangements of bases in DNA any more than 
magnetic forces of attraction explained the arrangement of letters on my letter board. But 
what did explain the arrangement of magnetic letters on that board? Obviously, an 
intelligent agent. Might the arrangement of bases in DNA have required such a cause as 
well? 

Recall Kauffman’s model systems. In each case, they explained the origin of information 
by reference to an unexplained source of information. These scenarios too lacked just 
what the design hypothesis provided: a cause known to be capable of generating 
information in the first place. In one case, Kauffman presupposes that his system will 
work only once it had been “tuned.” But how? Was intelligence necessary to do what 
self-organizational processes alone could not? 




A similar thought had occurred to me earlier when reflecting on chance elimination. 
Many events that we would not credit to chance — in particular, highly improbable events 
that matched independent patterns — were actually best explained by intelligent design. 
The improbable match between the two college papers that led my colleagues and me to 
exclude the chance hypothesis also led us to conclude plagiarism, a kind of design. 

Even Christian de Duve, in explaining why the origin of life could not have occurred by 

chance, acknowledged (if inadvertently) that the kind of events that lead us to reject 

chance also suggest design. Recall that de Duve pointed out that a “string of improbable 

events” such as someone winning the lottery twice in a row (a kind of pattern match) “ do 

not happen naturally .” 19 Of course, de Duve went on to state that the failure of the 

chance hypothesis implied, for him, that life must be “an obligatory manifestation of 
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matter” — one that had self-organized when the correct conditions arose. 

Having examined the leading self-organizational theories in detail, I now doubted this. I 
also later learned from de Duve himself that he felt compelled to elect a 
self-organizational model, because he was unwilling to consider design as an alternative 
to chance. He thought invoking design violated the rules of science. As he explains: 
“Cells are so obviously programmed to develop according to certain lines... that the word 
design almost unavoidably comes to mind, . . . [but] life is increasingly explained strictly in 
tenns of the laws of physics and chemistry. Its origin must be accounted for in similar 
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tenns.” As I explain later (see Chapters 18 and 19), I saw no reason to accept this 
prohibition against considering the design hypothesis. To me, it seemed like an 
unnecessary restriction on rational thought. So the failure of chance and 
self-organizational models — as well as the way these models failed — only made me more 
open to intelligent design as a possible hypothesis. 

Indeed, the design hypothesis now seemed more plausible to me than when I first 
encountered it and certainly more plausible than it had before I had investigated the two 
most prominent alternative categories of explanation, chance and necessity. But I knew 
that there was another category of explanation that I needed to investigate more fully, one 
that combined chance and necessity. Some of the most creative proposals for explaining 
the origin of biological information relied on the interplay of lawlike processes of 
necessity with the randomizing effects of “chance” variations. So I needed to examine 
this class of theories as well. The next two chapters describe what I discovered about 
them. 
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Chance and Necessity, or the Cat in the Hat Comes Back 



In the Dr. Seuss children’s book The Cat in the Hat Comes Back, a tall bipedal cat makes 
a return visit to the home of two children, Sally and her unnamed little brother. On the 
Cat’s first visit, he made a colossal mess, so the children are less than delighted to see 
him a second time. As before, the Cat makes himself right at home, this time by eating a 
slice of pink cake in the bathtub and leaving behind a pink ring. Ever helpful, the Cat 
immediately begins cleaning up his own mess. Unfortunately, every attempt he makes to 
remove the pink ring from the bathtub only results in spreading parts of the pink ring to 
various other household objects. Next the Cat pulls a series of smaller cats from his 
enormous hat to lend a hand, but none of them succeeds in solving the problem. Instead, 
now the pink stuff is everywhere. Finally the last and littlest cat produces a mysterious 
device called Voom, which suddenly and dramatically cleans up every shred of pink. 

What does this story have to do with the origin of biological information? As I examined 
more and more theories about the origin of life, I discovered a pattern, one curiously 
reminiscent of Dr. Seuss’s Cat and his indissoluble pink icing. Origin-of-life researchers 
have been looking for their “Voom” for over fifty years, a process that can, once and for 
all, clean up the problem of explaining the origin of information. I discovered that every 
attempt to explain the origin of biological information either failed or transferred the 
problem elsewhere — either by presupposing some other unexplained sources of 
information or by overlooking the indispensable role of an intelligence in the generation 
of the information in question. I first noticed this pattern as I was examining 
self-organizational theories. As I began to examine theories that invoked both chance and 
necessity in combination, I noticed this same pattern emerging in spades. 



The Classical Oparin Model: Chance and Necessity 

The strategy of combining chance with necessity to explain the origin of life began with 
Aleksandr Oparin, who modeled his approach on Darwin’s. Just as Darwin sought to 
explain the origin of new biological forms by reference to the interplay of random 
variations and the lawlike process of natural selection, Oparin sought to explain the origin 
of the first life by combining chance and various types of necessity, including both 
deterministic chemical reactions and a kind of prebiotic natural selection. 

In his original model, Oparin asserted that a series of chemical reactions between smaller 
molecules produced the first amino acids and other building blocks (see Chapter 2). 1 
These reactions took place almost exclusively by chemical necessity. Next Oparin 
envisioned a series of chance interactions between chemical building blocks, such as 



amino acids and sugars, eventually resulting in the first proteins and other complex 
polymers, such as carbohydrates, that are needed for establishing metabolic processes. 
After these molecules arose, and after some were enclosed in coacervate bodies, 
competition for nutrients ensued. Then “necessity” asserted itself again. The coacervate 
protocells containing the most complex metabolic processes and means of absorbing 
nutrients would have grown fastest and multiplied most often. As the availability of 
nutrient substances diminished, the more numerous, highly organized protocells would 
have overwhelmed simpler structures in the “struggle for existence.” 4 Oparin envisioned 
natural selection acting on random changes in the metabolic processes of coacervate 
bodies. More metabolically complex and efficient coacervates would generally 
outcompete the less complex ones. In this way, the interplay between chance variation 
and the imperatives of competition (necessity) gradually produced coacervates containing 
more complex metabolic processes and, eventually, a living cell with the features we see 
today. 

Oparin Revised: The Natural Selection of Unspecified Polymers 

The displacement problem was not evident in Oparin’s original scenario. But Oparin’s 
original theory did not explain the origin of the information in DNA. When Oparin 
published his theory in the 1920s and 1930s, he did not yet know about the infonnation 
encoded in DNA. He therefore, understandably, made no attempt to explain its origin. 

By the late 1950s, developments in molecular biology began to cast doubt on Oparin’s 
original scenario. Not only did his model give no explanation for the emerging DNA 
enigma; his scenario relied heavily on chance to explain the initial formation of the 
proteins that made cellular metabolism possible. The discovery of the extreme 
complexity and specificity of protein molecules undermined the plausibility of this 
proposal for the reasons discussed in Chapters 9 and 10. Protein function depends upon 
hundreds of specifically sequenced amino acids, and the odds of a single functional 
protein arising by chance alone are prohibitively low, given the probabilistic resources of 
the entire universe. 

As the complexity of DNA and proteins became apparent, Oparin published a revised 
version of his theory in 1968 that envisioned a role for natural selection earlier in the 
process of abiogenesis. The new version of his theory claimed that natural selection acted 
on unspecified polymers as they formed and changed within his coacervate protocells. 5 
Instead of natural selection acting on fully functional proteins in order to maximize the 
effectiveness of primitive metabolic processes at work within the protocells, Oparin 
proposed that natural selection might work on less than fully functional polypeptides, 
which would gradually cause them to increase their specificity and function, eventually 
making metabolism possible. He envisioned natural selection acting on “primitive 
proteins” rather than on primitive metabolic processes in which fully functional proteins 
had already arisen. By claiming that the first polymers need not have been highly 
sequence-specific to be preserved by natural selection, Oparin attempted to circumvent, 
at least partially, the information problem. 




I say “partially,” because Oparin’s revised scenario also attempted to explain the origin of 
genetic infonnation in DNA. It proposed that both unspecified polypeptides and 
unspecified polynucleotides arose by chance and were later enclosed within coacervates. 
Natural selection then began to act on both types of molecules by favoring the 
coacervates that gradually developed functionally specified proteins and genes over those 
that did not. 6 As the specificity of the DNA and protein molecules within the coacervates 
increased, they came into association with each other, gradually producing metabolic 
processes and mechanisms of infonnation storage and transmission. Thus Oparin sought 
to take into account the importance and apparent primacy of the genetic material to living 
systems. In doing so, he also sought to counter criticisms from advocates of a competing 
theory of abiogenesis known as the “gene theory.” 

Gene theorists such as Hennann Muller believed that life had evolved first in the form of 
a single (“living”) genetic molecule as the result of a fortuitous combination of 
chemicals. To such theorists, the primacy of DNA in information storage and its 
importance in initiating the process of protein synthesis suggested an inadequacy in 
Oparin’s original theory. Muller attacked Oparin’s view that life evolved first in a 
metabolizing multimolecular system (i.e., in coacervates) “on the grounds that while 
changes in genes lead to changes in metabolism, the reverse was known not to be the 
case.” 8 Oparin’s silence about a specifically genetic molecule in his 1936 work did leave 
his scenario vulnerable to the criticism that it was inadequate to explain the DNA enigma. 
But during the 1960s, Oparin adapted his theory to accommodate new information about 
the function and structure of DNA and the mechanisms of self-replication and protein 
synthesis. 9 This revision enabled it to weather this initial challenge and to remain in 
currency for another decade or so. 10 

Oparin’s revised theory nevertheless encountered almost immediate criticism. First, many 
scientists recognized that Oparin’s concept of prebiotic natural selection begged the 
question. Natural selection occurs only in organisms capable of reproducing or 
replicating themselves. Yet, in all extant cells, self-replication depends on functional and, 
therefore, sequence-specific DNA and protein molecules. As theoretical biologist 
Howard Pattee explains, “There is no evidence that hereditary evolution [natural 
selection] occurs except in cells which already have... the DNA, the replicating and 
translating enzymes, and all the control systems and structures necessary to reproduce 
themselves.” 11 But this fact of molecular biology posed an obvious difficulty for 
Oparin’s theory of prebiotic natural selection. In order to explain the origin of specified 
information in DNA, Oparin invoked a process that depends upon preexisting 
sequence-specific (i.e., information-rich) DNA molecules. Yet, the origin of these 
molecules is precisely what his theory needed to explain. As Christian de Duve explains, 
theories of prebiotic natural selection necessarily fail because they “need information 
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which implies they have to presuppose what is to be explained in the first place.” 

I tell a story in lectures to illustrate how the prebiotic natural selection hypothesis begs 
the question. My story is about a man who has fallen into a thirty-foot pit. To climb out 
of the pit, the man knows he needs a ladder. Unmoved by the severity of his difficulty, 




the man goes home and gets a ladder out of his garage, walks back to the pit, climbs back 

in, sets up the ladder, and climbs out of the hole. The ladder from his garage was just the 

ticket, but my story begs an obvious question: How did the man get out of the pit in the 

first place in order to fetch the ladder? In the same way, the concept of prebiotic natural 

selection begs the question of how nature generated the sequence-specific 

information-rich DNA and proteins that are needed to make self-replication, and thus 

natural selection, possible. Indeed, for this reason, Theodosius Dobzhansky, one of the 

leading evolutionary biologists of the twentieth century, insisted in 1968 that 
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“Prebiological natural selection is a contradiction in terms.” 

In fairness, Oparin did attempt to circumvent this problem. He proposed that prebiotic 
natural selection initially would act on unspecified strings of nucleotides and amino 
acids. But this created another problem for his scenario. Researchers pointed out that any 
system of molecules for copying infonnation would be subject to a phenomenon known 
as “error catastrophe” unless those molecules are specified enough to ensure an error-free 
transmission of information. An error catastrophe occurs when small errors — deviations 
from functionally necessary sequences — are amplified in successive replications. 14 Since 
the evidence of molecular biology shows that unspecified polypeptides will not replicate 
genetic information accurately, Oparin’s proposed system of initially unspecified 
polymers would have been highly vulnerable to such an error catastrophe. 

Thus, the need to explain the origin of specified information created an intractable 
dilemma for Oparin. If, on the one hand, Oparin invoked natural selection early in the 
process of chemical evolution (i.e., before functional specificity in amino acids or 
nucleotide strings had arisen), accurate replication would have been impossible. But in 
the absence of such replication, differential reproduction cannot proceed and the concept 
of natural selection is incoherent. 

On the one hand, if Oparin invoked natural selection late in his scenario, he would need 
to rely on chance alone to produce the sequence-specific molecules necessary for 
accurate self-replication. But even by the late 1960s, many scientists regarded that as 
implausible given the complexity and specificity of the molecules in question. 

Oparin’s dilemma was made all the more acute by the logic of natural selection itself. 15 
As proposed by Darwin, the process of natural selection favors or “selects” functionally 
advantageous variations. Thus, it can “select” only what random variations first produce. 
Yet it is extremely unlikely that random molecular interactions would produce the 
information present in even a single functioning protein or DNA molecule of modest 
length (see Chapters 9 and 10). And the improbability of biological infonnation arising 
by chance undennines the plausibility of prebiotic natural selection because it implies an 
exceedingly high (i.e., improbable) initial threshold of selectable function. Given the 
probabilistic resources of the whole universe, it is extremely unlikely that even one 
functional protein or DNA molecule — to say nothing of the suite of such molecules 
necessary to establish natural selection — would arise by chance. Yet the hypothesis of 




prebiotic natural selection presupposes that a series of such improbable events occurred 
before natural selection played any role at all. 



The work of John von Neumann, one of the leading mathematicians of the twentieth 
century, made this dilemma more acute. In 1966, von Neumann showed that any system 
capable of self-replication would require subsystems that were functionally equivalent to 
the infonnation storage, replicating, and processing systems found in extant cells. 16 His 
calculations established an extremely high threshold of minimal biological function, a 
conclusion that was confirmed by later experimental work. On the basis of minimal 
complexity and related considerations, several scientists during the late 1960s (von 
Neumann, physicist Eugene Wigner, biophysicist Harold Morowitz) made calculations 
showing that random fluctuations of molecules were extremely unlikely to produce the 
minimal complexity needed for a primitive replication system. 18 Indeed, as we saw in 
Chapters 9 and 10, the improbability of developing a replication system vastly exceeds 
the improbability of developing the protein or DNA components of such a system. 

As a result, by the late 1960s many scientists came to regard the hypothesis of prebiotic 
natural selection as indistinguishable from the pure chance hypothesis, since random 
molecular interactions were still needed to generate the initial complement of biological 
information that would make natural selection possible. Prebiotic natural selection could 
add nothing to the process of information generation until after vast amounts of 
functionally specified infonnation had first arisen by chance. Oparin’s idea of prebiotic 
natural selection succeeded only in pushing the pink ring back into the murky realm of 
small probabilities. 



DNA First 

As I investigated various other models combining chance and necessity, I found some 
highly creative proposals. One of the most creative was devised in 1964 by Henry 
Quastler, an early pioneer in the application of information theory to molecular biology. 
Quastler proposed a DNA-first model for the origin of life in which chance processes 
create a system of unspecified polynucleotides that can self-replicate via complementary 
base pairing (by chemical necessity). Like Oparin, Quastler thought these initial 
polynucleotides would have arisen without the specificity of sequencing necessary to 
build specific proteins. 19 Quastler considered this an advantage for his model. He 
acknowledged that it was extremely improbable that chance alone would produce a 
specific sequence of nucleotide bases long enough to function as a gene. Nevertheless, he 
knew that if there were no constraints on how DNA was sequenced — if any sequence 
would do — then chance alone might do the job. 

But then how did biological specificity and functional infonnation arise? Quastler 
thought that specificity in these molecules arose later, after his system of polynucleotides 
had — by chance — come to associate with a set of proteins and ribosomes capable of 
producing proteins from the particular nucleotide sequences that happened to have arisen. 
At some point, the previously unspecified polynucleotide sequences acquired specificity 




and functional significance by their association with a system of other molecules for 

producing proteins. Thus, Quastler characterized the origin of information in 
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polynucleotides as an “accidental choice remembered.” 

Quastler developed an illustration to convey what he had in mind using a combination 
lock. He asked his readers to imagine someone blindly choosing a combination — any 
combination — at random. Quastler suggested that after this combination was selected, 
someone could set the tumblers to match it. As a result, the combination would thereafter 
open the lock. Quastler used this illustration to show how an initially unspecified 
sequence that had arisen by chance could later acquire functional specificity. 

Quastler’s scenario possessed one overriding difficulty, however. It did not account for 
the origin of the specificity of the molecular system that conferred functional significance 
and specificity on the initial sequence of nucleotides. In Quastler’s combination lock 
example, a conscious agent chose the tumbler settings that made the initial combination 
functionally significant. Further, engineers designed the lock that made it possible to 
specify the tumbler settings. Yet Quastler expressly precluded conscious design as a 
possibility for explaining the origin of life. Instead, he seemed to suggest that the origin 

of the set of functionally specific proteins (and the translation system) necessary to create 
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a “symbiotic association” between polynucleotides and proteins would arise by chance. 
He offered some rough calculations to show that a multimolecular system could have 
originated by chance in the prebiotic soup. But these calculations, which were perfonned 
in 1964, are no longer credible. They vastly underestimated the improbability of 

generating a single protein by chance alone, to say nothing of the whole gene-expression 
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system. 

Moreover, Quastler’s approach to the problem of the origin of biological information 
provided another striking example of the displacement problem. Quastler “solved” the 
problem of the origin of complex specificity in nucleic acids only by transferring the 
problem to an equally complex and specified system of proteins, RNA molecules, and 
ribosomes. In his scenario, any polynucleotide sequence would suffice at first, but the 
proteins and ribosomal material that made up the translation system that arose later would 
have to exhibit an extreme specificity relative to the initial polynucleotide sequence. 
Each of these molecules would also have to possess a specific shape and sequence in 
order to play the functional role that Quastler expected it to play in converting 
unspecified sequences of nucleotide bases into proteins. The proteins would have to 
possess the correct sequences of amino acids in order to be able to unwind and copy 
genetic information; the ribosomal proteins and RNAs would need to be sequenced 
precisely in order to fold into subunits that fit together to form a functional ribosome; the 
transfer RNAs would have to mediate specific associations in order to convert the 
random sequences of bases on the polynucleotides into specific amino-acid sequences; 
and the sequences of amino acids thus produced would have to be arranged precisely in 
order to fold into stable three-dimensional structures. Like other scenarios I had 
encountered, his attempt to bypass the sequence-specificity problem merely shifted it 
elsewhere. 




Hypercycles 



Another popular model for explaining the origin of life — one that combines chance and 
chemical necessity 24 — was proposed in the late 1970s by Manfred Eigen, a Gennan 
biophysicist who won the Nobel Prize in Chemistry in 1967. Eigen proposed a system 
called a hypercycle to explain how new biological infonnation and structure might have 
accumulated over time as life developed. A hypercycle is a hypothetical 
self-reproducing system made of many enzymes and RNA molecules. In the hypercycle, 
enzymes and RNA molecules react with each other chemically to make structural 
improvements in the system as a whole. The system develops as one group of enzymes 
increases the replication rate of a sequence-specific RNA molecule, which in turn 
increases the rate of production of other enzymes, which in turn increases the replication 
rate of a different RNA molecule, and so on. Eventually, a group of enzymes increases 
the replication rate of the original RNA in the reaction sequence, forming a cycle that 
repeats indefinitely (see Fig. 13.1). Theoretically, the RNA and protein molecules of a 
hypercycle can be organized in an interdependent way to give the hypercycle, as a whole, 
a kind of structural stability. Because Eigen proposed this as a mechanism for increasing 
the molecular information and structure of a developing protocell, some have claimed 
that he solved — at least in theory — the information problem associated with life’s origin. 

Critics of his model, such as evolutionary biologist John Maynard Smith, physicist 
Freeman Dyson, and chemist Robert Shapiro, have contested this assessment, showing 
that Eigen’s hypothetical cycles are more likely to lose or degrade genetic information 
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over time. They point out that hypercycles, as conceived by Eigen, lack an error-free 
mechanism of self-replication. As a result, his proposed mechanism would succumb to 
various “error catastrophes” that would diminish, rather than increase, the specified 
information content of the system over time. 

In any case, hypercycles have a more obvious limitation. In formulating the idea, Eigen 
presupposed a large initial complement of information in the form of sequence-specific 
RNA molecules and some forty functional proteins. Thus, his model does not solve, even 
hypothetically, the problem of the ultimate origin of biological information. At best, it 
models how information might have developed from a preexisting source, though even 
that is doubtful for the reasons his critics have described. Eigen himself has apparently 
acknowledged that hyper-cycles presuppose, but do not explain, the ultimate origin of 
biological information. He notes that he devised hypercycles to model how information 
and structure might develop, rather than originate ? 





Figure 13.1. A schematic representation of Manfred Eigen’s concept of a hypercycle. The 
E’s represent enzymes capable of replicating RNA molecules, the R’s in the circles 
represent RNA molecules. Notice that hypercycles require an initial complement of both 
sequence-specific enzymes and information-rich RNA molecules. 



“Life” in a Computer: Genetic Algorithms 

As it became more difficult to envision how life arose by chance and necessity from the 
molecular constituents of actual living cells, evolutionary biologists sought to simulate 
the production of biological infonnation using computer programs. The first such 
attempts occurred during the late 1980s. At that time, Richard Dawkins and Bernd-Olaf 
Kiippers attempted to simulate how natural selection acting on random mutations could 
explain the origin of biological information. Each tried to do this by developing software 
programs that simulated the production of genetic information.^ 8 Both accepted the 
futility of naked appeals to chance and invoked what Kiippers calls a “Darwinian 
optimization principle.” Both used computers to demonstrate the efficacy of natural 
selection. Thus, both combined chance and necessity in a classically Darwinian way. 

In order to show how chance and selection can produce information, Dawkins and 
Kiippers first provided the computer program with a target sequence to represent a 
desired functional polymer. In their programs, the target sequences were English phrases, 
such as the line from Shakespeare’s play Hamlet, “Methinks it is like a weasel.” Next the 
computer program generated another alphabetic sequence the same length as the target. 
Then the program generated a crop of variant sequences at random. After that, the 
computer selected the sequences from among the crop that matched the target sequence 
most closely. The computer then preserved those sequences and amplified the production 
of them, eliminated the others (to simulate differential reproduction), and repeated the 
process. As Kiippers puts it, “Every mutant sequence that agrees one bit better with the 
meaningful or reference sequence... will be allowed to reproduce more rapidly.” 29 After a 
mere thirty-five generations, his computer succeeded in spelling his target sequence, 
“Natural selection.” Dawkins’s simulation took only a bit longer. In a mere forty-three 
generations, his program produced: “Methi nk s it is like a weasel.” 



Such simulations are impressive from a distance, but they conceal an obvious flaw: 
molecules in situ do not have a target sequence “in mind.’’ In biology, where differential 
survival depends upon maintaining function, selection cannot occur until functional 
sequences arise. Nor will sequences confer any selective advantage on a cell and thus 
differentially reproduce until they combine in a functionally advantageous arrangement. 
Nothing in nature (biology or chemistry) corresponds to the role that the computer plays 
in selecting functionally nonadvantageous sequences that happen to agree “one bit better” 
than others with a target sequence. The simulation doesn’t even make sense on its own 
tenns. The sequence “normal election” may agree more with “natural selection” than 
does the sequence “mistress defection,” but neither of the two yields any advantage in 
communication over the other in trying to communicate something about natural 
selection. If that is the goal, both are equally ineffectual. 

More to the point, a completely nonfunctional polypeptide would confer no selective 
advantage on a hypothetical protocell, even if its sequence happened to “agree one bit 
better” with an unrealized target protein than some other nonfunctional polypeptide. Such 
a molecule would not be preserved but eliminated, stopping the evolutionary process in 
its tracks. Yet both Kiippers’s and Dawkins’s published results of their simulations show 
the early generations of variant phrases awash in nonfunctional gibberish. In Dawkins’s 
simulation, not a single functional English word appears until after the tenth iteration 
(unlike the more generous example above that starts with the actual, albeit incorrect, 
words). Clearly it is impossible to distinguish sequences based on considerations of 
comparative functional advantage when the sequences in question have no function 
whatsoever. Such detenninations can be made only if considerations of proximity to 
possible future function are allowed, but that requires foresight, the very thing natural 
selection lacks. 

As philosopher and mathematician David Berlinski has argued, genetic algorithms need 
something akin to a “forward-looking memory” to succeed. Yet such foresighted 
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selection has no analogue in nature. In biology, where differential survival depends 
upon maintaining function, selection cannot occur before new functional sequences arise. 
Natural selection lacks foresight. A computer programmed by a human being can make 
selections based upon relative proximity to distant targets or goals, but to imply that 
molecules can do so illicitly personifies nature. If computer simulations demonstrate 
anything, they subtly demonstrate the need for an intelligent agent to elect some options 
and exclude others — that is, to create information. 



Ev 

Since the publication of Kuppers’s and Dawkins’s work, other biologists have devised 
more sophisticated evolutionary algorithms to simulate how mutation and 
selection — chance and necessity — can generate new information. One of the most famous 
examples of such an algorithm is called Ev. Ev was created by Thomas Schneider, a 
research biologist at the National Institutes of Health. Schneider cites the success of Ev 




in simulating the production of information “from scratch” to support his claim that the 
undirected material processes of “replication, mutation and selection are necessary and 
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sufficient for information gain to occur.” 

In particular, Schneider claims Ev can simulate the production of nucleotide binding sites. 
Nucleotide binding sites are very short sequences of nucleotides on the genome located 
upstream of specific genes. These binding sites enable RNA polymerase to bind to the 
correct place on a genome, thus allowing it to read and copy the genes in question. The 
binding sites typically represent between six and twenty specifically sequenced 
nucleotides, a small (but specified) amount of information in relation to an entire gene. 
Schneider claims that “the program simulates the process of evolution of new binding 
sites from scratch ,” 34 He further indicates that the Ev program has thus created 131 bits 
of information. 35 

Though the Ev program uses a more complex set of subroutines than either of the early 
genetic algorithms devised by Kiippers or Dawkins, it uses essentially the same strategy 
to ensure that the program will generate an information-rich sequence. Like Kiippers and 
Dawkins, Schneider supplies Ev with a target sequence, in this case a particular sequence 
of nucleotide bases that function as a binding site. He then has the program generate a 
random crop of sequences of equal length. After favoring sequences that manifest the 
general profile of a binding site, Ev applies a fitness function to the remaining sequences. 
The fitness function then assesses the degree of divergence between the mutated 
sequences and the target sequence and applies an error value to each mutated sequence. 
Then the Ev program preserves the sequence(s) with the least error (i.e., the degree of 
difference from the target) and permits them to replicate and mutate. Then he repeats 
the process again and again, until finally Ev converges on the target sequence. 

The target sequence involves foresight. So too does the program’s fitness function, which 
makes selections based upon proximity to future function. Thus, it again simulates a 
goal-directed foresight that natural selection does not possess. It makes use of 
information about a functional state (nucleotide binding sites) in a way that natural 
selection cannot. 

Ev incorporates one additional step that Dawkins’s and Kiippers ’s simulations lack. 
Before Ev applies its fitness function, it applies a filter to the crop of mutated sequences. 
The filter favors sequences that have the general profile of a binding site. Like the fitness 
function, this coarser filter makes use of infonnation about the functional requirements of 
binding sites to favor some sequences over others. As such, it imparts information based 
on knowledge that Thomas Schneider, not natural selection or the environment, has 
imparted into the Ev simulation. Ev exhibits the genius of its designer. 



Informational Accounting 

Recently, the senior engineering professor Robert Marks, formerly of the University of 
Washington in Seattle and now at Baylor University in Texas, analyzed evolutionary 




algorithms such as Ev. Marks shows that despite claims to the contrary by their 
sometimes overly enthusiastic creators, algorithms such as Ev do not produce large 
amounts of functionally specified information “from scratch.” Marks shows that, instead, 
such algorithms succeed in generating the information they seek either by providing 
information about the desired outcome (the target) from the outset, or by adding 
information incrementally during the computer program’s search for the target. To 
demonstrate this, Marks distinguishes and defines three distinct kinds of information: 
exogenous information, endogenous information, and active information. 

“Endogenous information” represents the infonnation present in the target. It also 
provides a measure of the difficulty of the search for that target — that is, the 
improbability of finding the specific sequence, or target, among the exponentially large 
space of alternative possibilities. Recall that the amount of information present in a 
sequence or system is inversely proportional to the probability of the sequence or system 
arising by chance. If the probability of finding the target is small the information required 
to find the target is correspondingly large. By calculating the size of the space of 
alternative possibilities in which the target resides, the computer scientist can detennine 
both the probability of finding the target in a random search and the infonnation content 
of the target in question. Marks’s analysis of evolutionary algorithms shows that, in order 
to produce or find the (endogenous) information present in the target, a programmer must 
design a search algorithm that reduces the information requirements of the search to a 
manageable level. The information added by the programmer to reduce the difficulty of 
the search he dubs “active infonnation.” The “exogenous information” is what is left after 

the active infonnation is subtracted from the endogenous information. It measures the 
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difficulty of the residual search problem. 

In his critique of Ev as well as other evolutionary algorithms, Marks shows that each of 
these putatively successful simulations of undirected mutation and selection actually 
depends upon several sources of active infonnation. The Ev program, for example, uses 
active information by applying a filter to favor sequences with the general profile of a 
nucleotide binding site. And it uses active information in each iteration of its evaluation 
algorithm or fitness function. The fitness function in Ev uses information about the target 
sequence to assess degrees of difference between a prespecified target and the mutated 
sequences produced by the program. Those sequences that have the lowest error 
values — greatest proximity to the prespecified functional sequence — are selected to 
replicate and mutate. Marks shows that each time the program uses knowledge of the 
target sequence to exclude some sequences and preserve others, it imparts a quantifiable 
amount of infonnation in its selection. Marks quantifies these sources of active 
information and shows that they reduce the difficulty well below the 131 bits Schneider 
claims that Ev can produce “from scratch.” He also shows that the endogenous 
information in even modestly difficult search problems usually cannot be generated (or 
the search problem solved) without added or “active” information to assist the search. 



Avid a 




Since the release of Ev in 2000, another evolutionary algorithm has attracted widespread 
interest. A program called Avida — A for artificial, and vida for life — was designed in 
1993 by three computer scientists: Christopher Adami and Titus Brown, of Caltech, and 
Charles Ofria, of Michigan State University. In 2003, Adami and Ofria, teamed up with 
biologist Richard Lenksi and philosopher Robert Pennock, both of Michigan State, to 
publish an article in Nature describing the results of an evolutionary simulation they had 
conducted with Avida . 40 In the article, the authors claimed that Avida demonstrates that 
“digital organisms” capable of replication could generate complex features and functions 
“by random mutations and natural selection.” 41 Avida is now widely cited to show that 
biological complexity (and presumably the information necessary to produce it) could 
have arisen from the twin forces of chance and necessity. " Unlike the earlier simulations 
of Dawkins, Kiippers, and Schneider, the Avida program does not provide the computer 
with a specific target sequence, nor does it select sequences of characters on the basis of 
proximity to possible future function. Does it therefore demonstrate the efficacy of 
chance and necessity in a way that earlier simulations did not? 

Avida consists of two main parts, the Avida world and the Avida digital organisms that 
populate this world. Avida organisms have two basic parts: software and hardware. Each 
Avida organism has a small software program consisting of a loop of instructions 
composed from a set of twenty-six predefined commands. Additionally, each digital 
organism also consists of virtual hardware that can execute each of the twenty-six 
commands among the set of possibilities. 

These commands direct Avida organisms to perform various operations on two fixed 
input strings of binary digits, thus producing various output strings. The loop of 
instructions contains a program that has been written to replicate the loop (initially this is 
all the instructions in the loop do). Upon replication, the loop of instructions is mutated as 
the Avida world makes random changes to the loop by inserting, deleting, or switching 
commands, thus making it possible for Avida organisms to generate new output strings 
from the two fixed input strings (see Fig. 13.2). 

These Avida organisms exist within a larger Avida “world.” The Avida world represents 
the rules by which the digital organisms operate and compete with each other for 
survival. The Avida world contains an evaluation algorithm — what a colleague of mine 
calls the “sniffer” — that detennines whether one of nine logical relationships exists 
between the input strings given to each digital organism and the output strings they 
produce. Another way to think of this is that the sniffer is evaluating whether a “logic 
function” has been performed on the input strings to produce the output string. For 
example, one of the logic functions the sniffer is looking for is OR. If the OR function is 
applied to a pair of binary characters, then it will generate a “1” if at least one or the other 
of the characters is a “1.” Otherwise it will generate a “0.” 43 If all the binary digits in an 
output string are consistent with any of the nine logic functions having been applied to 
each pair of binary digits along the two input strings, then the sniffer will recognize that a 
logic function has been performed. (See Fig. 13.2.) 
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Figure 13.2. Visual representation of the Avida organism within Avida world. The loop of 
instructions (left) operates on the input sequences (top) while stored in the registers 
(center) to produce an output sequence (bottom). The evaluation algorithm (depicted in 
personified form but only for effect) compares the input sequence to the output sequence 
to see if one of nine specific logical relationships exists between the sequences. If so, the 
Avida world rewards the Avida organism with resources that improve the organism’s 
ability to replicate itself. 



In that case, the Avida organism that performed the logic function is rewarded with 
resources that improve its ability to replicate itself. Replication is imperfect, meaning that 
the “offspring” of an Avida organism may carry random changes to the instructions in 
their program loops. These changes involve replacing a command with another (from the 
set of twenty-six), inserting a command (making the loop longer), or deleting a command 
(making the loop shorter). 

The authors of the Nature paper claim to have used Avida to simulate how natural 
selection and mutation can produce more complex functions by rearranging genomic 
instructions. Their simulation showed that after many generations Avida produced digital 
organisms capable of performing both simple and compound logic functions. (A 
compound logic function consists of a series of discrete logic functions that have been 
performed in sequence.) 44 As the authors put it, starting from a digital organism that 
could only replicate itself, Avida evolved “digital organisms capable of performing 
multiple logic functions requiring the coordinated execution of many genomic 
instructions.” 45 Since, in actual organisms, biological functions invariably result from 
specified sequences of bases — that is, from specified information — and since the Avida 
authors claim to have simulated the production of complex functions, they also 



effectively claim to have simulated the production of new biological information. So, 
does this state-of-the-art algorithm solve or at least simulate a solution to the DNA 
enigma? Well, not exactly. 



Avida Presupposes Information 

First, even as a demonstration of how biological evolution might generate new biological 
information starting from a preexisting organism, Avida leaves much to be desired. As I 
discuss in more detail in an accompanying note , 46 Avida lacks realism as a simulation of 
biological evolution because the program selects functionally significant logic functions 
possessing too little complexity to represent the actual infonnation content of functional 
proteins or genes. By allowing it to choose logic functions that are far simpler than any 
functional gene or protein, the program diminishes the probabilistic task that nature 
would face in “trying” to evolve the first self-reproducing cell. Thus, the program does 
not simulate how a random search through a relevantly sized combinatorial space could 
generate new information — information and structure that must arise first by random 
mutation before natural selection can act to preserve it. Yet any complete theory of 
biological evolution must explain precisely the origin of the specified infonnation present 
in new genes and proteins. 

In any case, Avida does not simulate how the information necessary to produce the first 
organism might have originated. Much like Eigen’s hypercycles, Avida begins with a 
sizable complement of preexisting information, including virtual “organisms” that have 
been programmed with the capacity to self-replicate. In an Avida organism much of this 
information is stored in the loop of discrete instructions drawn from the set of twenty-six. 
These instructions express discrete commands such as, “Calculate the difference between 
the values in two registers.” These instructions direct the Avida organism to perform 
specific tasks on the strings of binary digits (the inputs) described above. These 
commands, whether expressed in the C++ programming language, a machine code, or 
translated into English, represent a significant amount of functionally specified 
information. 

In addition, the instructions on the initial loop in the digital organisms were specifically 
arranged and programmed to replicate themselves. The sequencing of these discrete 
instructions on the loop constitutes another significant source of preexisting information 
beyond the information in the instructions themselves. Still further, an intricately 
designed computing device interprets the instruction sets. This computer hardware 
constitutes a highly improbable and precisely configured arrangement of parts. These 
devices represent a sizable amount of structural information that would have required a 
large (and, in principle, calculable) infusion of digital infonnation to manufacture. 
Therefore, if the Avida organisms tell us anything about the origin of life, they tell us 
something about how much information must be supplied before a self-replicating 
organism capable of selection could arise. 




In fairness, Avida’ s advocates do not explicitly claim that their program simulates the 
origin of the information necessary to build a cell in the first place. Instead, they claim 
that Avida shows how new complex functions and features (and the instructions 
necessary to produce them) might have arisen from preexisting organisms. Even so, much 
of the hype surrounding Avida fails to distinguish these two cases. In a recent conference 
presentation at Oxford, Robert Pennock, one of the four authors of the Avida article in 
Nature, claimed — rather categorically — that Avida simulates how natural selection and 
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random mutation produce “design without a designer.” 

Even if one were to ignore the intelligent design of the Avida program itself and grant 
that Avida simulates how some appearances of design might have arisen without a 
designer starting from a preexisting self-replicating organism, it would not follow that 
Avida has simulated the origin of the most salient appearance of design: the original 
“machine code of the genes.” Instead, Avida presupposes, but does not explain, the origin 
of the infonnation necessary to produce the first self-replicating organism. Avida “solves” 
the problem of the origin of biological information by presupposing its solution at the 
outset. It dissolves the pink stuff only by using more of it as a solvent. 



The Conservation of Information 

The failure of Ev and Avida to simulate a naturalistic solution to the DNA enigma 
illustrates a more general problem with evolutionary algorithms and, indeed, a more 
general principle about infonnation itself. Invariably, evolutionary algorithms succeed in 
producing specified information (or its functional equivalent) as the result of preexisting 
information, or the programming skill of computer scientists, or both. With the exception 
of a small and quantifiable amount of information that may arise as the result of random 
effects, the infonnation produced in these algorithms does not exceed the information 
that was provided to them by the programmers who designed them (and the engineers 
who designed the hardware). In some programs, such as Richard Dawkins’s rather simple 
simulation, the role of the programmer in providing the information necessary to the 
outcome is obvious. In others programs, such as Avida, programmers play a more subtle 
role, though not one that escapes a careful accounting of the source of critical 
informational inputs. Either way, infonnation in a computational context does not 
magically arise without the assistance of the computer scientist. 

My office is in Redmond, Washington, and I have friends who are computer 
programmers for some of the software companies in the area. One of my friends is a 
retired forty-something programmer, who was fonnerly one of Microsoft’s elite 
architect-level programmers. He also has a special interest in the origin of life and 
evolutionary algorithms. He said something interesting to me about these programs: 
“There is absolutely nothing surprising about the results of these algorithms. The 
computer is programmed from the outset to converge on the solution. The programmer 
designed the code to do that. What would be surprising is if the program didn’t converge 
on the solution. That would reflect badly on the skill of the programmer. Everything 




interesting in the output of the program came as a result of the programmer’s skill — the 
information input. There are no mysterious outputs.” 



Computer science has two principles that codify this insight. Indeed, these principles can 
be or, in some cases, were derived from a careful analysis of evolutionary algorithms 
themselves. The first principle is called the “no free lunch” (NFL) theorem. The theorem 
was first developed by David Wolpert and William Macready, two computer scientists at 
NASA’s Ames Research Center . 48 It describes a constraint that evolutionary algorithms 
invariably face as they attempt to find information-rich targets — such as a meaningful 
sequence of letters or a functional arrangement of nucleotide bases — in large 
combinatorial spaces. The NFL theorem states that an evolutionary algorithm will, on 
average, perform no better than a blind search in finding a target within a large space of 
possibilities unless external sources of infonnation are provided to the algorithm to point 
it toward the target. In other words, finding an information-rich target usually requires an 
enonnous amount of “active infonnation” to reduce the difficulty of a random search to a 
manageable level (i.e., to a level commensurate with the probabilistic resources available 
to the evolutionary algorithm). 

The game Twenty Questions illustrates this principle. In the game, one person has 
information about a person, place, or thing that another person must guess. If the person 
guessing is clever, he or she can acquire that information by asking a series of yes or no 
questions that gradually narrows the field of possibilities. But each time the person 
guessing receives a yes or no to a question, he or she is also receiving infonnation. The 
NFL theorem simply states that such infonnation is usually — barring a rare and very 
lucky search — indispensable to the success of the search. Richard Dawkins’s simulation 
provided an obvious illustration of this principle. His evolutionary algorithm succeeded, 
but only because he provided it with an external source of infonnation — in particular, a 
target sequence and fitness function — which the computer uses to guide its selection of 
sequences in each successive iteration of its search. 

The NFL theorem affirms what a careful analysis of individual evolutionary simulations 
reveals: the infonnation produced by an evolutionary simulation does not arise for “free,” 
that is, without an input from the programmer. Large informational outputs require 
(roughly) equivalent infonnational inputs. 

The second principle relevant to the assessment of evolutionary algorithms is closely 
related to the first. It is called the law of the “conservation of information” (COI). Leon 
Brillouin, a French-American physicist and innovator in computer science and 
information theory, states the law in its most basic form: “The computing machine does 
not create any new information .” 49 Robert Marks has a more colorful way of expressing 
the same idea. “Computers,” he says, “are no more able to create information than iPods 
are capable of creating music .” 50 Computer scientists have formulated various 
mathematically precise laws of conservation of information to express this basic 
principle. Most state that within certain quantifiable limits the amount of information in a 
computer in its initial state (considering both its hardware and software) equals or 




exceeds the amount of information in its final state. A careful analysis of evolutionary 
algorithms confirms this principle. Genetic algorithms can “solve” the information 
problem, but only if programmers first supply information about proximity to target 
sequences, selection criteria, or loops of precisely sequenced instructions. 

But I noticed something else. The idea of the conservation of infonnation also seems to 
apply beyond the computer domain. Indeed, most of us know from our ordinary 
experience that information typically degrades over time unless intelligent agents 
generate (or regenerate) it. The sands of time have erased some inscriptions on Egyptian 
monuments. The leak in the attic roof smudged the ink in the stack of old newspapers, 
making some illegible. In the game of Telephone, the message received by the last child 
in line bears little resemblance to the one the first child whispered. Common experience 
confirms this general trend — and so do prebiotic simulation experiments and 
origin-of-life research. Simulation experiments produce biologically relevant molecules, 
but only if experimentalists manipulate initial conditions and guide outcomes — that is, 
only if they first provide specified information themselves. Those origin-of-life theories 
“succeed” in accounting for the origin of information only by presupposing an ample 
supply of information in some preexisting form. 

In light of this, I fonnulated a conservation law of my own as a working hypothesis to 
distill my experience and what I had discovered about origin-of-life research. Since I was 
not principally concerned with whether biological evolution could generate specified 
information, I decided to fonnulate a “conservative” conservation law — one that applied 
only to a nonbiological context (and thus not to an infonnation-rich initial state). My 
statement of the law does not say anything about whether undirected natural processes 
could produce an increase in specified information starting from preexisting forms of life. 
But it does encapsulate what repeated experience had demonstrated about the flow of 
information starting from chemistry and physics alone. 

Here’s my version of the law of conservation of infonnation: “In a nonbiological context, 
the amount of specified information initially present in a system, Si, will generally equal 
or exceed the specified infonnation content of the final system, Sf.” This rule admits only 
two exceptions. First, the information content of the final state may exceed that of the 
initial state, Si, if intelligent agents have elected to actualize certain potential states while 
excluding others, thus increasing the specified information content of the system. Second, 
the information content of the final system may exceed that of the initial system if 
random processes have, by chance, increased the specified information content of the 
system. In this latter case, the potential increase in the information content of the system 
is limited by the “probabilistic resources” available to the system. 

1 

As noted in Chapter 10, the probabilistic resources of the entire universe equal 10 
trials, which, in turn, corresponds to an informational measure of less than 500 bits. This 
represents the maximum information increase that could be reasonably expected to occur 
by chance from the big-bang singularity to the present — without assistance from an 
intelligent agent. Systems that exist over a shorter period of time will have 




correspondingly smaller envelopes of maximal information increase and will, in any case, 
usually experience informational loss or degradation without input from an agent. Taking 
these caveats into account allows a more general statement of the law as follows: “In a 
nonbiological context and absent intelligent input, the amount of specified infonnation of 
a final system, Sf, will not exceed the specified information content of the initial system, 
Si, by more than the number of bits of information the system’s probabilistic resources 
can generate, with 500 bits representing an upper bound for the entire observable 
universe.” 51 



Conclusion 

The law of the conservation of infonnation as variously articulated provides another way 
of describing the displacement problem that I had seen repeatedly demonstrated by 
attempts to solve the DNA enigma. The failure of genetic algorithms to simulate the 
production of specified information within the obviously artificial domain of a computer 
only provided another illustration of this apparently ubiquitous problem. These attempts 
to simulate how purely undirected processes might have produced infonnation only 
pushed the information problem back to a decidedly directing entity — the human mind. 
Thus, in an oddly unexpected way, evolutionary algorithms and the conservation 
principle derived in part from analyzing them pointed to an external source of specified 
information. For this reason, the law of conservation of information seemed to have 
profound implications for the origin-of-life debate. 

But is the conservation law true? It was certainly consistent with everything I knew from 
my ordinary prescientific experience. And I had discovered nothing in my study of 
origin-of-life research to contradict the law. Quite the opposite was the case. Model after 
model failed precisely to explain the origin of biological infonnation, the DNA enigma. 
Those theories that appeared to solve the problem of the origin of infonnation — the 
metaphorical “pink stuff’ — did so only by displacing or transferring it elsewhere. 

Yet I also knew that there was another major hypothesis about the origin of life that was 
attracting attention. It too combined chance and necessity, but envisioned a role for 
natural selection much earlier in the process of abiogenesis. This hypothesis also held out 
the possibility of explaining the classical “chicken and egg” problem — the origin of the 
interdependence of DNA and proteins — by starting the process of abiogenesis in a 
different place — or rather, with a different molecule. 

The “RNA world” hypothesis as it is called, is the big one — the idea that has currently 
captivated the wider scientific community and a host of working molecular biologists “at 
the bench.” Could this model solve the problem of the origin of biological information 
without displacing it elsewhere? Does it refute the law of infonnation conservation — or at 
least provide a documented exception to it? Before I could establish that biological 
information pointed decisively to an intelligent cause, I knew I would need to examine 
this theory in more detail. The idea was too big and too popular to ignore. 
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The RNA World 



By the mid-1980s many researchers concluded that both DNA-first and protein- first 
origin-of-life models were beset with many difficulties. As a result, they sought a third 
way to explain the mystery of life’s origin. Instead of proposing that the first 
informational molecules were proteins or DNA, these scientists argued that the earliest 
stages of abiogenesis unfolded in a chemical environment dominated by RNA molecules. 
The first scientist to propose this idea was Carl Woese, a microbiologist at the University 
of Illinois. Walter Gilbert, a Harvard biophysicist, later developed the proposal and 
coined the term by which it is now popularly known, the “RNA world.” 1 

The RNA world is now probably the most popular theory of how life began. Scientists in 
some of the most prestigious labs around the world have perfonned experiments on RNA 
molecules in an attempt to demonstrate its plausibility, and in the opinion of many 
scientists, the RNA-world hypothesis establishes a promising framework for explaining 
how life on earth might have originated. 

I had an encounter with one such scientist in the spring of 2000. I had just written an 

article about DNA and the origin of life in the April issue of a prominent New York 

journal of opinion. When the letters to the editor came in, I initially blanched when I 

saw one from a fierce critic named Kenneth R. Miller, a biology professor at Brown 

University and a skilled debater. Had I made a mistake in reporting some biological detail 

in my argument? When I saw his objection, however, I was relieved. Miller claimed that 

my critique of attempts to explain the origin of biological information had failed to 

address the “RNA first” hypothesis. Miller asserted that I had ignored “nearly two 

decades of research on this very subject” and failed to tell my “readers of experiments 

showing that very simple RNA sequences can serve as biological catalysts and even 
/ 1 

self-replicate.” 




Figure 14.1. Walter Gilbert, photographed in front of a chalkboard in his office at 
Harvard. Courtesy of Peter Menzel/Science Photo Library 



Miller was half right. I hadn’t told my readers about these experiments. But I knew that 
two decades of research on this topic had not solved the problem of the origin of 
biological information. Because of space constraints and the format of the journal, I had 
decided not to address this issue in my original article. But now Miller’s letter gave me a 
chance to do so. 

At the time I had been studying research articles from origin-of-life specialists who were 
highly critical of the RNA-world hypothesis, and in my response to Miller I cited and 
summarized many of their arguments. I heard nothing more from Miller on the matter, 
but as I attended various conferences over the next several years, I discovered that he was 
far from alone. Despite the pervasive skepticism about the RNA world among leading 
origin-of-life researchers, many practicing molecular biologists, including some very 
prominent scientists at famous labs, continued to share Miller’s enthusiasm. Moreover, I 
discovered that many of these molecular biologists had recently initiated new 
experimental work inspired by their confidence in the viability of the RNA-world 
approach. Had they solved the information problem? 



Second Things First 




The RNA world is a world in which the chicken and egg no longer confound each other. 
At least that has been the hope. Building proteins requires genetic information in DNA, 
but information in DNA cannot be processed without many specific proteins and protein 
complexes. This problem has dogged origin-of-life research for decades. The discovery 
that certain molecules of RNA possess some of the catalytic properties seen in proteins 
suggested a way to solve the problem. RNA-first advocates proposed an early stage in the 
development of life in which RNA performed both the enzymatic functions of modem 
proteins and the information-storage function of modern DNA, thus sidestepping the need 
for an interdependent system of DNA and proteins in the earliest living system. 

Typically RNA-first models have combined chance events and a law-like process of 
necessity, in particular, the process of natural selection. As Gilbert and others envision it, 
a molecule of RNA capable of copying itself (or copying a copy of itself) first arose by 
the chance association of nucleotide bases, sugars, and phosphates in a prebiotic soup 
(see Fig. 14.2). Then because that RNA enzyme could self-replicate, natural selection 
ensued, making possible a gradual increase in the complexity of the primitive 
self-replicating RNA system, eventually resulting in a cell with the features we observe 
today. Along the way, a simple membrane, itself capable of self-reproduction, enclosed 
the initial RNA enzymes along with some amino acids from the prebiotic soup. 4 

According to this model, these RNA enzymes eventually were replaced by the more 
efficient proteins that perform enzymatic functions in modern cells. For that to occur, the 
RNA-replicating system first had to begin producing a set of RNA enzymes that could 
synthesize proteins. As Gilbert has explained, in this step RNA molecules began “to 
synthesize proteins, first by developing RNA adapter molecules that can bind activated 
amino acids and then by arranging them according to an RNA template using other RNA 
molecules such as the RNA core of the ribosome.” 5 Finally, DNA emerged for the first 
time by a process called reverse transcription. In this process, DNA received the 
information stored in the original RNA molecules, and eventually these more stable DNA 
molecules took over the information-storage role that RNA had performed in the RNA 
world. At that point, RNA was, as Gilbert put it, “relegated to the intermediate role it has 
today — no longer the center of the stage, displaced by DNA and the more effective 
protein enzymes.” 6 





Figure 14.2. The RNA World Scenario in Seven Steps. Step 1: The building blocks of 
RNA arise on the early earth. Step 2: RNA building blocks link up to form RNA 
oligonucleotide chains. Step 3: An RNA replicase arises by chance and selective 
pressures ensue favoring more complex forms of molecular organization. Step 4: RNA 
enzymes begin to synthesize proteins from RNA templates. Step 5: Protein-based protein 
synthesis replaces RNA-based protein synthesis. Step 6: Reverse transcriptase transfers 
genetic information from RNA molecules into DNA molecules. Step 7: The modern gene 
expression system arises within a proto-membrane. 



I knew that origin-of-life theories that sound plausible when stated in a few sentences 
often conceal a host of practical problems. And so it was with the RNA world. As I 
investigated this hypothesis, both before and after my exchange with Professor Miller, I 
found that many crucial problems lurked in the shadows, including the one I had seen 
before: the theory did not solve the problem of biological infonnation — it merely 
displaced it. 

Because so many scientists assume that the RNA world has solved the problem of the 
origin of life, this chapter will provide a detailed and, in some places, technical critique of 
this hypothesis. My critique details five crucial problems with the RNA world, 
culminating in a discussion of the information problem. To assist nontechnical readers, I 
have placed some of this critique in notes for the scientifically trained. I would ask 
technically minded readers to read these notes in full, because in some cases they provide 
important additional support for, or qualifications to, my arguments. 

Each element of this critique stands mostly on its own. So if you find that the technical 
material under one subheading presupposes unfamiliar scientific concepts or tenninology, 
take note of the heading, which summarizes the take-home message of the section, and 
skip ahead to the next one, or even the final two, which address the theory’s greatest 
weakness: its inability to explain the origin of biological information. 



Problem 1: RNA Building Blocks Are Hard to Synthesize and Easy to Destroy 

Before the first RNA molecule could have come together, smaller constituent molecules 
needed to arise on the primitive earth. These include a sugar known as ribose, phosphate 
molecules, and the four RNA nucleotide bases (adenine, cytosine, guanine, and uracil). It 
turns out, however, that both synthesizing and maintaining these essential RNA building 
blocks, particularly ribose (the sugar incorporated into nucleotides) and the nucleotide 
bases, has proven either extremely difficult or impossible to do under realistic prebiotic 
conditions. 7 (See Fig. 14.3.) 

Consider first the problems with synthesizing the nucleotide bases. In the years since the 
RNA world was proposed, chemist Robert Shapiro has made a careful study of the 
chemical properties of the four nucleotide bases to assess whether they could have arisen 
on the early earth under realistic conditions. He notes first that “no nucleotides of any 
kind have been reported as products of spark-discharge experiments or in studies of 
meteorites.” Stanley Miller, who performed the original prebiotic simulation experiment, 
published a similar study in 1998. 8 Moreover, even if they did somehow form on the 
early earth, nucleotide bases are too chemically fragile to have allowed life enough time 
to evolve in the manner Gilbert and other RNA-first theorists envision. Shapiro and 
Miller have noted that the bases of RNA are unstable at temperatures required by 
currently popular high-temperature origin-of-life scenarios. The bases are subject to a 
chemical process known as “deamination,” in which they lose their essential amine 
groups (NH 2 ). At 100 degrees C, adenine and guanine have chemical half-lives of only 
about one year; uracil has a half-life of twelve years; and cytosine a half-life of just 




nineteen days. Because these half-lives are so short, and because the evolutionary process 
envisioned by Gilbert would take so long — especially for natural selection to find 
functional ribozymes (RNA molecules with catalytic activity) by trial and error — Stanley 
Miller concluded in 1998 that “a high temperature origin of life involving these 
compounds [the RNA bases] therefore is unlikely.” 9 Miller further noted that, of the four 
required bases, cytosine has a short half-life even at low temperatures, thus raising the 
possibility that “the GC pair” (and thus RNA) “may not have been used in the first 
genetic material.” Shapiro concurred. He showed that it would have been especially 
difficult to synthesize adenine and cytosine at high temperatures and cytosine even at low 
temperatures. Thus he concluded that the presumption that “the bases, adenine, cytosine, 
guanine and uracil were readily available on the early earth” is “not supported by existing 
knowledge of the basic chemistry of these substances.” 10 
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Figure 14.3. The chemical structure and constituents of RNA. 



Producing ribose under realistic conditions has proven even more problematic. Prebiotic 
chemists have proposed that ribose could have arisen on the early earth as the by-product 



of a chemical reaction called the formose reaction. The formose reaction is a multistep 
chemical reaction that begins as molecules of formaldehyde in water react with one 
another. Along the way, the formose reaction produces a host of different sugars, 
including ribose, as intermediate by-products in the sequence of reactions. But, as 
Shapiro has pointed out, the formose reaction will not produce sugars in the presence of 
nitrogenous substances. 11 These include peptides, amino acids, and amines, a category of 
molecules that includes the nucleotide bases. 

This obviously poses a couple of difficulties. First, it creates a dilemma for scenarios that 
envision proteins and nucleic acids arising out of a prebiotic soup rich in amino acids. 
Either the prebiotic environment contained amino acids, which would have prevented 
sugars (and thus DNA and RNA) from forming, or the prebiotic soup contained no amino 
acids, making protein synthesis impossible. Of course, RNA-lirst advocates might try to 
circumvent this difficulty by proposing that proteins arose well after RNA. Yet since the 
RNA-world hypothesis envisions RNA molecules coming into contact with amino acids 
early on within the first protocellular membranes (see above), choreographing the origin 
of RNA and amino acids to ensure that the two events occur separately becomes a 
considerable problem. 

The RNA-world hypothesis faces an even more acute, but related, obstacle — a kind of 
catch-22. The presence of the nitrogen-rich chemicals necessary for the production of 
nucleotide bases prevents the production of ribose sugars. Yet both ribose and the 
nucleotide bases are needed to build RNA. (See note for details). As Dean Kenyon 
explains, “The chemical conditions proposed for the prebiotic synthesis of purines and 
pyrimidines [the bases] are sharply incompatible with those proposed for the synthesis of 
ribose.” Or as Shapiro concludes: “The evidence that is currently available does not 
support the availability of ribose on the prebiotic earth, except perhaps for brief periods 
of time, in low concentration as part of a complex mixture, and under conditions 
unsuitable for nucleoside synthesis.” 4 

Beyond that, both the constituent building blocks of RNA and whole RNA molecules 
would have reacted readily with the other chemicals present in the prebiotic ocean or 
environment. These “interfering cross-reactions” would have inhibited the assembly of 
RNA from its constituent monomers and inhibited any movement from RNA molecules 
toward more complex biochemistry, since the products of these reactions typically 
produce biologically inert (or irrelevant) substances. 

Furthermore, in many cases, reactions (such as the fonnose reaction) that produce 
desirable by-products such as ribose also produce many undesirable chemical 
by-products. Unless chemists actively intervene, undesirable and desirable chemical 
by-products of the same reaction react with each other to alter the composition of the 
desired chemicals in ways that would inhibit the origin of life. In sum, synthesizing the 
building blocks of the RNA molecule under realistic prebiotic conditions has proven 
formidably difficult. 




Problem 2: Ribozymes Are Poor Substitutes for Proteins 



Another major problem with the RNA world is that naturally occurring RNA molecules 
possess very few of the specific enzymatic properties of proteins. To date, scientists have 
shown that ribozymes can perform a small handful of the thousands of functions 
performed by modem proteins. Scientists have shown that some RNA molecules can 
cleave other RNA molecules (at the phosphodiester bond) in a process known as 
hydrolysis. Biochemists also have found RNAs that can link (ligate) separate strands of 
RNA (by catalyzing the formation of phosphodiester bonds). Other studies have shown 
that the RNA in ribosomes (rRNA) promotes peptide-bond fonnation within the 
ribosome 15 and can promote peptide bonding outside the ribosome, though only in 
association with an additional chemical catalyst. 16 Beyond that, RNA can perform only a 
few minor functional roles and then usually as the result of scientists intentionally 
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“engineering” or “directing” the RNA catalyst (or ribozyme) in question. 

For this reason, claiming that catalytic RNA could replace proteins in the earliest stages 
of chemical evolution is extremely problematic. To say otherwise would be like asserting 
that a carpenter wouldn’t need any tools besides a hammer to build a house, because the 
hammer performed two or three carpentry functions. True, a hammer does perform some 
carpentry functions, but building a house requires many specialized tools that can 
perform a great variety of specific carpentry functions. In the same way, RNA molecules 
can perform a few of the thousands of different functions proteins perfonn in “simple” 
single cells (e.g., in the E. coli bacterium), but that does not mean that RNA molecules 
can perfonn all necessary cellular functions. 



Problem 3: An RNA-based Translation and Coding System Is Implausible 

The inability of RNA molecules to perform many of the functions of protein enzymes 
raises a third and related concern about the plausibility of the RNA world. RNA-world 
advocates offer no plausible explanation for how primitive self-replicating RNA 
molecules might have evolved into modem cells that rely on a variety of proteins to 
process genetic information and regulate metabolism. 18 

To evolve beyond the RNA world, an RNA-based replication system eventually would 
have to begin to produce proteins, and not just any proteins, but proteins capable of 
template-directed protein manufacture. But for that to occur, the RNA replicator first 
would need to produce machinery for building proteins. In modem cells it takes many 
proteins to build proteins. So, as a first step toward building proteins, the primitive 
replicator would need to produce RNA molecules capable of performing the functions of 
the modem proteins involved in translation. (Recall from Chapter 5 that translation is the 
process of building proteins from the instmctions encoded on an mRNA transcript.) 
Presumably, these RNA molecules would need to perform the functions of the twenty 
specific tRNA synthetases and the fifty ribosomal proteins, among the many others 
involved in translation. At the same time, the RNA replicator would need to produce 
tRNAs and the many mRNAs carrying the information for building the first proteins. 




These mRNAs would need to be able to direct protein synthesis using, at first, the 
transitional ribozyme-based protein-synthesis machinery and then, later, the pennanent 
and predominantly protein-based protein-synthesis machinery. In short, the evolving 
RNA world would need to develop a coding and translation system based entirely on 
RNA and also generate the information necessary to build the proteins that later would be 
needed to replace it. 

This is a tall order. The cell builds proteins from the infonnation stored on the mRNA 
transcript (i.e., the copy) of the original DNA molecule. To do this, a bacterial cell 
depends upon a translation and coding system consisting of 106 distinct but functionally 
integrated proteins as well several distinct types of RNA molecules (tRNAs, mRNAs, and 
rRNAs). 19 This system includes the ribosome (consisting of fifty distinct protein parts), 
the twenty distinct tRNA synthetases, twenty distinct tRNA molecules with their specific 
anticodons (all of which jointly embody the genetic code), various other proteins, 
free-floating amino acids, ATP molecules (for energy), and — last, but not 
least — information-rich mRNA transcripts for directing protein synthesis. Furthermore, 
many of the proteins in the translation system perform multiple functions and catalyze 
coordinated multistep chemical transfonnations (see Fig. 14.4). 

Is it possible that a similar translation and coding system capable of producing 
genetically encoded proteins might first have arisen using only RNA catalysts 
(ribozymes)? Advocates of the RNA-world hypothesis have defended the possibility 
because of the demonstrated catalytic properties of some RNA molecules. Eugene 
Koonin and Yuri Wolf, two prominent scientists at the National Center for 
Bio-technology Information, recently reviewed the results of research on the capacities of 
RNA catalysts in an important article assessing the plausibility of an RNA-based 
translation system. They note that in the last twenty years, molecular biologists have 
documented, or engineered, ribozymes that can catalyze “all three elementary 
reactions” required for translation, including aminoacylation (the fonnation of a bond 
between an amino acid and an RNA), the peptidyl-transferase reaction (which fonns the 
peptide bond between amino acids), and amino-acid activation (in which adenosine 
monophosphate is attached to an amino acid). 

At first glance, these results may seem to support the feasibility of an RNA-based 
translation system. Nevertheless, significant reasons to doubt this aspect of the 
RNA-world hypothesis remain, as Koonin and Yuri note. First, though ribozymes have 
demonstrated the capacity to catalyze representative examples of the three main types of 
chemical reactions involved in translation, they have not demonstrated the ability to 
catalyze anywhere near all the necessary reactions that fall within these general 
classifications. Moreover, the gap between “some” and “all” necessary reactions of a 
given type remains significant. For example, ribozyme engineers have successfully 
designed an RNA molecule that will catalyze the formation of an aminoacyl bond 
between itself and the amino acids leucine and phenylalanine. But no one has yet 
demonstrated that RNA can catalyze aminoacyl bonds with the other eighteen 
protein-forming amino acids, still less with the specificity required to make the resulting 




molecules useful for translation. Yet establishing a genetic code requires molecules that 
can catalyze highly specific aminoacylation for each of the twenty protein-forming amino 
acids. To say that RNA can catalyze “aminoacylation” is true, but it obscures the 
distinction between part of a group and the whole group, where having the whole group 
of molecules is necessary to the function in question. Again, it takes more than a hammer 
to build a house. 



orwei 

Pfecsrtu^ 

A. 

i CfcMEuArEoH 



?JT5 

(j. J nji.ir..rT< 



Ah±wo 






AMJWt? 
fi-C/L - 




Figure 14.4. The main molecular components of the translation system: twenty specific 
transfer-RNA molecules, twenty specific aminoacyl tRNA synthetases, the ribosome with 
its two main subunits composed of fifty proteins and ribosomal RNA, the 
messenger-RNA transcript and a supply of amino acids. 



Second, unlike RNA catalysts (ribozymes), the protein-based enzymes involved in 
translation perfonn multiple functions, often in closely integrated or choreographed ways. 
Ribozymes, however, are the one-trick ponies of the molecular world. Typically, they can 



perform one subfunction of the several coordinated functions that a corresponding 
enzyme can perform. But they cannot perform the entire range of necessary functions, 
nor can they do so with the specificity needed to execute the many sequentially 
coordinated reactions that occur during translation. 

Consider what ribozymes must do to rival the capacities of the synthetases that catalyze 
aminoacylation, which occurs between tRNA molecules and their “conjugate” amino 
acids during translation in actual cells. Researchers have demonstrated that certain RNA 
molecules can bind a protein-forming amino acid, phenylalanine, to itself, thus 
performing the function of aminoacylation. They have even isolated a version of the 
RNA catalyst that binds only phenylalanine, achieving a specificity of sorts. But the 
synthetase enzymes responsible for aminoacylation in life must catalyze a complex 
two-stage chemical reaction involving three kinds of molecules: amino acids, ATP 
(adenosine triphosphate), and tRNAs. 

In the first stage of this reaction, synthetases couple ATP to a specific amino acid, giving 
it the stored energy (in the form of adenosine monophosphate, AMP) needed to establish 
a bond with a tRNA molecule. Next, synthetases couple specific tRNA molecules to 
specific activated (AMP-charged) amino acids. These tRNAs have specific shapes and 
anticodon sites that enable them to bond to mRNA at the ribosome. Thus, synthetases 
help form molecular complexes with a specificity of fit and with specific binding sites 
that enable translation to occur in the context of a whole system of associated molecules. 

The RNA catalyst proposed as a precursor to the synthetase cannot do this. It does not 
couple ATP to amino acids as a precursor to catalyzing aminoacylation. Instead, the 
ribozyme engineer provides “preadenylated” amino acids (amino acids already li nk ed to 
AMP molecules). Nor does the RNA catalyst couple an amino acid to a specific tRNA 
with a specific anticodon. The more limited specificity it achieves only ensures that the 
RNA catalyst will bind a particular amino acid to itself, a molecule that does not possess 
the specific cloverleaf shape or structure of a tRNA. Moreover, this RNA does not carry 
an anticodon binding site corresponding to a specific codon on a separate mRNA 
transcript. Thus, it has no functional significance within a system of molecules for 
performing translation. Indeed, no other system of molecules has even been proposed 
that could confer functional significance or specificity on the amino acid-RNA 
complexes catalyzed by the aminoacyl ribozyme. 

Thus, even in the one case where ribozyme engineers have produced an RNA-aminoacyl 
catalyst, the ribozyme in question will not produce a molecule with a functional 
specificity, or capacity to perform coordinated reactions, equivalent to that of the 
synthetases used in modem cells. Yet without this specificity and capacity to coordinate 
reactions, translation — the construction of a sequence-specific arrangement of amino 
acids from the specific RNA transcript — will not occur. 

Similar limitations affect the RNA catalysts that have been shown to be capable of 
peptidyl-transferase activity (i.e., catalyzing peptide bonds between amino acids). These 




ribozymes (made of free-standing ribosomal RNA) compare quite unfavorably with the 
capacities of the protein-dominated ribosomes that perfonn this function in extant cells. 
For example, researchers have found that free-standing ribosomal RNA can only catalyze 
peptide-bond fonnation in the presence of another catalyst. More important, apart from 
the proteins of the ribosome, free-standing ribosomal RNA does not force amino acids to 

link together into linear chains, which is essential to protein function. (For more details, 
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see the note.) 

Why RNA Catalysts Can ’t Do What True Enzymes Can 

There is a fundamental chemical reason for the limited functionality of RNA 
catalysts — one that casts still further doubt on the RNA-world hypothesis and specifically 
on its account of the origin of the translation system. Because of the inherent limitations 
of RNA chemistry, single RNA molecules do not catalyze the coordinated multistep 
reactions that enzymes, such as synthetases, catalyze. Even if separate RNA catalysts can 
be found that catalyze each of the specific reactions involved in translation (which is by 
no means certain), that would leave us very far short of a translation system. Each pony 
of the RNA world does only its one trick. And even if all the ponies were present 
together, each one would do only its particular trick separately, decoupled from the 
others. That’s a problem, because producing the molecular complexes necessary for 
translation requires coupling multiple tricks — multiple crucial reactions — in a closely 
integrated (and virtually simultaneous) way. True enzyme catalysts do this. RNA and 
small-molecule catalysts do not. 

Here’s the chemical backstory. Enzymes couple energetically favorable and unfavorable 
reactions together into a series of reactions that are energetically favorable overall. As a 
result, they can drive forward two reactions where ordinarily only one would occur with 
any appreciable frequency. Water runs downhill because of favorable energetics provided 
by gravitational force. Water does not run uphill, however, unless there is so much of it 
that it accumulates and slowly rises up the bank. Whether chemical reactions will occur 
readily depends upon whether there is enough energy to make them occur. Molecules 
with enough stored energy to establish new chemical bonds will react readily with one 
another. Molecules with insufficient stored energy will not react readily with each other 
unless vast amounts of the reactants are provided (the equivalent of the rising water 
flooding the banks). 

Enzymes use a reaction that liberates energy to drive forward a reaction that requires 
energy, coupling energetically favorable and unfavorable reactions together. Enzymes 
can do this because they have a complex three-dimensional geometry that enables them 
to hold all the molecules involved in each step of the reaction together and to coordinate 
their interactions. But two independent catalysts cannot accomplish what a compound 
catalyst (i.e., an enzyme) can. And so far RNA ribozymes have demonstrated the capacity 
to act only as independent catalysts, not true enzyme catalysts. RNA catalysts might 
catalyze some energetically favorable reactions, but without the sophisticated active sites 




of enzymes, they can’t couple those favorable reactions to energetically unfavorable 
reactions. 26 (See Fig. 14.5.) 
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Figure 14.5. Enzymes couple energetically favorable and unfavorable reactions together 
into a series of reactions that are energetically favorable overall. Enzymes can 
accomplish this because they have a three-dimensional specificity that allows them to 
sequester and correctly position all the molecules involved in a series of such reactions. 
RNA catalysts cannot do this. The figure above shows an enzymatically mediated 
reaction called aminoacylation. The diagram shows the specificity of fit between a tRNA 
synthetase and a tRNA molecule during this two-stage chemical reaction. The synthetase 
links the tRNA to a specific amino acid using energy from ATP, thus coupling 
energetically favorable and unfavorable reactions. Amino acids and ATP molecules are 
not pictured. They would be enveloped by the synthetase during the reactions represented 
by the chemical equations. 



Thus, the demonstration that RNA can catalyze “all the elementary reactions” of 
translation, but neither the suite of functions nor the coordinated functions performed by 
the necessary enzyme catalysts of the extant translation system, does little to establish the 



plausibility of ribozyme -based protein synthesis, let alone the transition to enzyme-based 
protein synthesis, that the RNA-world scenario requires. The inability to account for the 
origin of the translation system and genetic code, therefore, remains a formidable barrier 
to the success of the RNA-world hypothesis. 



Problem 4: The RNA World Doesn’t Explain the Origin of Genetic Information 

As I sifted through the primary scientific literature on the RNA-world hypothesis, it did 
not take me long to realize that the hypothesis faced significant problems quite apart from 
the central sequencing problem that most interested me. Yet I also realized that it did not 
resolve the mystery of the origin of biological information — which I had, heretofore, 
called the DNA enigma. Indeed, I now realized that I might just as easily have called that 
mystery the “RNA enigma,” because the information problem looms just as large in a 
hypothetical RNA world as it does in a DNA world. This is not actually surprising. The 
RNA world was proposed not as an explanation for the origin of biological information, 
but as an explanation for the origin of the interdependence of nucleic acids and proteins 
in the cell’s information-processing system. And as I studied the hypothesis more 
carefully, I realized that it presupposed or ignored, rather than explained, the origin of 
sequence specificity — infonnation — in various RNA molecules. 

Consider the step in the RNA-world scenario that I just examined — getting from a 
primitive replicator to a system for building the first proteins. Even if a system of 
ribozymes for building proteins had arisen from an RNA replicator, that system of 
molecules would still need information-rich templates for building specific proteins. 
RNA-world advocates give no account of the origin of that information beyond vague 
appeals to chance. But as I argued in Chapters 8-10, chance is not a plausible explanation 
for the infonnation necessary for building even one protein of modest length, let alone a 
set of RNA templates for building the proteins needed to establish a protein-based 
translation system and genetic code. 

The need to account for these templates of information stands as a fonnidable challenge 
to the RNA world. Nevertheless, the hypothesis faces an even more basic information 
problem: the first self-replicating RNA molecules themselves would have needed to be 
sequence-specific in order to perform the function of replication, which is a prerequisite 
of both natural selection and any further evolution toward cellular complexity. 

Though the RNA world was originally proposed as an explanation for the “chicken and 
egg” functional interdependence problem, not the infonnation problem, some 
RNA-world advocates nevertheless appear to think that it can somehow leapfrog the 
sequence-specificity requirement. They imagine short chains (oligomers) of RNA arising 
by chance on the prebiotic earth. Then, after a sufficiently large pool of these molecules 
had arisen, some would have acquired the ability to self-replicate. In such a scenario the 
capacity to self-replicate would then favor the survival of those RNA molecules that 
could do so and thus would favor the specific sequences that the first self-replicating 




molecules happened to have. Thus, self-replication arose again as kind of “accidental 
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choice remembered.” 

But like Quastler’s DNA-iirst model discussed in the last chapter, this scenario merely 
shifts the specificity problem out of view. First, for strands of RNA to perform catalytic 
functions (including self-replication) they, like proteins, must display specific 
arrangements of their constituent building blocks (nucleotides in the RNA case). In other 
words, not just any sequence of RNA bases will be capable of self-replication. Indeed, 
experimental studies indicate that RNA molecules with the capacity to replicate 
themselves, if they exist at all, are extremely rare among possible RNA base sequences. 
Although no one has yet produced a fully self-replicating RNA molecule, 2 some 
researchers have engineered a molecule that can copy a part of itself — though only about 
10 percent of itself and then only if a complementary primer strand is provided to the 
ribozyme by the investigator. Significantly, the scientists selected this partial 
self-replicator out of an engineered pool of 1,000 trillion (10 15 ) other RNA molecules, 
almost all of which lack even this limited capacity for self-replication. This suggests 
that sequences with this capacity are extremely rare and would be especially so within a 
random (nonengineered) sample. 

Further, for an RNA molecule to self-replicate, the RNA strand must be long enough to 
form a complex structure. Gerald Joyce and the late Leslie Orgel are two prominent 
origin-of-life researchers who have evaluated the RNA- world scenario in detail. They 
consider, for the sake of argument, that a replicase could form in a 50-base RNA strand, 
though they are clearly skeptical that an RNA sequence of this length would really do the 
job. Experimental results have confirmed their skepticism. Jack Szostak, a prominent 
ribozyme engineer, and his colleagues have found that it typically takes at least 100 bases 
to fonn structures capable of catalyzing simple ligation (linking) reactions. He estimates 
that getting a ligase capable of performing the other functions that polymerases must 
perform — “proper template binding, fidelity and strand separation” — may require 
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between 200 and 300 nucleotides. The ribozyme mentioned above — the one that can 
partially copy itself — required 189 nucleotide bases. It is presently unclear how many 
bases would be needed to generate enough structural complexity to allow true polymerase 
function, since no molecule capable of both complete and unassisted self-replication has 
yet been engineered. It may be as low as 189 bases, but it may be much higher, or it may 
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simply be impossible. Moreover, the problem may be more basic than length. RNA, 
with its limited alphabet of four bases, may not even have the capacity to form the 
complex three-dimensional shapes and distributions of charge necessary to perform 
polymerase or replicase function. 

In any case, even if we suppose that RNA-based RNA polymerases (replicases) are 
possible, experimental evidence indicates that they would have to be 
information-rich — both complex and specified — just like modem DNA and proteins. Yet 
explaining how the building blocks of RNA might have arranged themselves into 
information-rich sequences has proven no easier than explaining how the parts of DNA 
might have done so, given the requisite length and specificity of these molecules. As 




Christian de Duve has noted in critique of the RNA-world hypothesis, “Hitching the 
components together in the right manner raises additional problems of such magnitude 
that no one has yet attempted to do so in a prebiotic context.” 34 

Certainly, appeals to chance alone have not solved the RNA infonnation problem. A 
100-base RNA molecule corresponds to a space of possibilities equal to 4 100 (or 10 60 ). A 
200-base RNA molecule corresponds to 4“ 00 (or 10 170 ) possibilities. Given this and the 
experiments mentioned above showing the rarity of functional ribozymes (to say nothing 
of polymerases) within RNA sequence space, the odds of a functional, self-replicating 
RNA sequence arising by chance are exceedingly small. Moreover, the odds against such 
an event occurring are only compounded by the likely presence of destructive 
cross-reactions between desirable and undesirable molecules within any realistic 
prebiotic environment. 

To make matters worse, as Gerald Joyce and Leslie Orgel note, for a single-stranded 
RNA catalyst to produce an RNA identical to itself (i.e., to “self-replicate”), it must find 
an appropriate RNA molecule nearby to function as a template, since a single-stranded 
RNA cannot function as both replicase and template. Moreover, as they observe, this 
RNA template would have to be the precise complement of the replicase. Once this 
chance encounter occurred, the replicase molecule could make a copy of itself by making 
a complement of its complement (i.e., by transcribing the template), using the physics of 
nucleotide base pairing. 3 

This requirement, of course, compounds the infonnational problem facing this crucial 
step in the RNA-world scenario. Even if an RNA sequence could acquire the replicase 
function by chance, it could perform that function only if another RNA molecule — one 
with a highly specific sequence relative to the original — arose close by. (See Fig. 14.6.) 
Thus, in addition to the specificity required to give the first RNA molecule 
self-replicating capability, a second RNA molecule with an extremely specific 
sequence — one with essentially the same specificity as the original — would also have to 
arise. RNA-world theorists do not explain the origin of the requisite specificity in either 
the original molecule or its complement. Orgel and Joyce have calculated that to have a 
reasonable chance of finding two such complementary RNA molecules of a length 
sufficient to perform catalytic functions would require an RNA library of some 10 48 
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RNA molecules. The mass of such a library vastly exceeds the mass of the earth, 
suggesting the extreme implausibility of the chance origin of a primitive replicator 
system. They no doubt vastly underestimate the necessary size of this library and the 
actual improbability of a self-replicating couplet of RNAs arising, because, as noted, they 
assume that a 50-base RNA might be capable of self-replication. (See note for qualifying 
details.) 37 

Given these odds, the chance origin of even a primitive self-replicating system — one 
involving a pair of sequence-specific (i.e., information-rich) replicases — seems extremely 
implausible. And, yet, invoking natural selection doesn’t reduce the odds or help explain 
the origin of the necessary replicators since natural selection ensues only after 




self-replication has arisen. As Orgel and Joyce explain, “Without evolution [i.e., prebiotic 
natural selection] it appears unlikely that a self-replicating ribozyme could arise, but 
without some form of self-replication there is no way to conduct an evolutionary search 
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for the first primitive self-replicating ribozyme.” 

Robert Shapiro has resorted to one of my old standbys — Scrabble letters — to illustrate 
why neither chance, nor chance and natural selection combined, can solve the sequencing 
problem in the RNA world. While speaking in 2007 at a private conference on the origin 
of life, he asked an elite scientific audience to imagine an enormous pile of Scrabble 
letters. Then he said, “If you scooped into that heap [of letters], and you flung them on 
the lawn there, and the letters fell into a line which contained the words, ‘To be or not to 
be, that is the question,’ that is roughly the odds of an RNA molecule, given no feedback 
[natural selection] — and there would be no feedback, because it [the RNA molecule] 
wouldn’t be functional until it attained a certain length and could copy itself — appearing 
on earth.” 39 

If neither chance, nor chance and selection, can solve the RNA sequencing problem, can 
self-organization do the trick? It can’t. RNA bases, like DNA bases, do not manifest 
bonding affinities that can explain their specific arrangements. Thus, no one has even 
attempted to solve the RNA sequencing problem by proposing a “self-organizational 
RNA world scenario.” Instead, the same kind of evidentiary and theoretical problems 
emerge whether one proposes that genetic information arose first in RNA or DNA 
molecules. And every attempt to leapfrog the sequencing problem by starting with 
supposedly “information-generating” RNA replicators has only shifted the problem to the 
specific sequences that would be needed to make such replicators functional. 



Once again the pink stuff was spreading. 
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Figure 14.6. The minimal requirements for template-directed RNA self-replication as 
envisioned by Joyce and Orgel. They insist that any RNA replicase would need to come 
into close proximity to an exact complementary strand, thus increasing the needed 
sequence specificity associated with getting such self-replication (and natural selection) 
started. 



Problem 5: Ribozyme Engineering Does Not Simulate Undirected Chemical 
Evolution 

Because of the difficulties with the RNA-world hypothesis and the limited number of 
enzymatic functions that naturally occurring ribozymes can perform, a new cottage 



industry has sprung up in molecular biology. Scientists sympathetic to the RNA world 
have sought to design new RNA molecules with heretofore unobserved functions. In 
doing so, these scientists hoped not only to learn more about RNA chemistry, but also to 
demonstrate the plausibility of the RNA-world hypothesis and possibly even to 
synthesize an artificial form of life . 40 

These ribozyme-engineering experiments typically deploy one of two approaches: the 
“rational design” approach or the “directed evolution” approach. In both approaches, 
biologists try to generate either more efficient versions of existing ribozymes or 
altogether new ribozymes capable of performing some of the other functions of proteins. 
In the rational-design approach, the chemists do this by directly modifying the sequences 
of naturally occurring RNA catalysts. In the directed-evolution (or “irrational design”) 
approach, scientists seek to simulate a form of prebiotic natural selection in the process of 
producing ribozymes with enhanced functional capacities. To manage this they screen 
pools of RNA molecules using chemical traps to isolate molecules that perform particular 
functions. After they have selected these molecules out of the pool, they generate variant 
versions of these molecules by randomly altering (mutating) some part of the sequence of 
the original molecule. Then they select the most functional molecules in this new crop 
and repeat the process several times until a discernible increase in the desired function 
has been produced. 

Most ribozyme-engineering procedures have been performed on ligases, ribozymes that 
can link together two RNA chains (oligomers) by forming a single (phosphodiester) bond 
between them. Ribozyme engineers want to demonstrate that these ligases can be 
transformed into true polymerases or “replicases.” These polymerases would not only 
link nucleotides bases together (by phosphodiester bonds), but also would stabilize the 
exposed template strands, and use the exposed bases as a template to make 
sequence-specific copies. 

Polymerases are the holy grail of ribozyme engineering. According to the RNA-world 
hypothesis, once a polymerase capable of template-directed self-replication arose, then 
natural selection could have become a factor in the subsequent chemical evolution of life. 
Since ligases can perform one, though only one, of the several functions perfonned by 
true polymerases, RNA-world theorists have postulated ligases as the ancestral molecular 
species from which the first self-replicating polymerase arose. They have tried to 
demonstrate the plausibility of this conjecture by using ribozyme engineering to build 
polymerases (or replicases) from simpler ligase ribozymes. 

To date, no one has succeeded in engineering a fully functional RNA-based RNA 
polymerase, from either a ligase or anything else . 41 Ribozyme engineers have, however, 
used directed evolution to enhance the function of some common types of ligases. As 
noted, they also have produced a molecule that can copy a small portion of itself. Leading 
ribozyme engineers such as Jack Szostak and David Bartel have presented these results as 
support for an undirected process of chemical evolution starting in an RNA world . 42 
Popular scientific publications and textbooks have often heralded these experiments as 




models for understanding the origin of life on earth and as the leading edge of research 
establishing the possibility of evolving an artificial form of life in a test tube. 

Yet these claims have an obvious flaw. Ribozyme engineers tend to overlook the role that 
their own intelligence has played in enhancing the functional capacities of their RNA 
catalysts. The way the engineers use their intelligence to assist the process of directed 
evolution would have no parallel in a prebiotic setting, at least one in which only 
undirected processes drove chemical evolution forward. Yet this is the very setting that 
ribozyme experiments are supposed to simulate. 

RNA-world advocates envision ligases evolving via undirected processes into RNA 
polymerases that can replicate themselves from free-standing bases, thereby establishing 
the conditions for the beginning of natural selection. In other words, these experiments 
attempt to simulate a transition that, according to the RNA-world hypothesis, would have 
taken place before natural selection had begun to operate. Yet in order to improve the 
function of the ligase molecules, the experiments actually simulate what natural selection 
does. Starting from a pool of random sequences, the investigators create a chemical trap 
to isolate only those sequences that evince ligase function. Then they select those 
sequences for further evolution. Next they use a mutagenesis technique to generate a set 
of variant versions of these original ligases. Then they isolate and select the best 
sequences — those manifesting evidence of enhanced ligase function or indications of 
future polymerase function — and repeat the process until some improvement in the 
desired function has been realized. 

But what could have accomplished these tasks before the first replicator molecule had 
evolved? Szostak and his colleagues do not say. They certainly cannot say that natural 
selection played this role, since the origin of natural selection as a process depends on the 
prior origin of the self-replicating molecule that Szostak and his colleagues are working 
so hard to design. Instead, in their experiment, Szostak and his colleagues play a role that 
nature cannot play until a self-replicating system, or at least a self-replicating molecule, 
has arisen. Szostak and his colleagues function as the replicators. They generate the crop 
of variant sequences. They make the choices about which of these sequences will survive 
to undergo another round of directed evolution. Moreover, they make these choices with 
the benefit of a foresight that neither natural selection nor any other undirected or 
unintelligent process can — by definition — possess . 43 Indeed, the features of the RNA 
molecules that Szostak and his colleagues isolate and select are not features that would, 
by themselves in a precellular context, confer any functional advantage. 

Of course, ligase enzymes perfonn functions in the context of modem cells and in that 
setting might confer a selectable advantage on the cells that possess them. But prior to the 
origin of the first self-replicating protocell, ligase ribozymes would not have any 
functional advantage over any other RNAs. At that stage in chemical evolution, no 
self-reproducing system yet existed upon which any advantage could be conferred. 




The ability to link (ligate) nucleotide chains is, at best, a necessary but not a sufficient 
condition of polymerase or replicase function. Absent a molecule or, what is more likely, 
a system of molecules possessing all of the features required for self-replication, nature 
would not favor any RNA molecule over any other. Natural selection as a process selects 
only functionally advantageous features and only in self-replicating systems. It passes its 
blind eye over molecules possessing merely necessary conditions or possible indicators 
of future function. Moreover, “it” does nothing at all when mechanisms for replication 
and selection do not yet even exist. In ribozyme-engineering experiments, engineers 
perform a role in simulating natural selection that undirected natural processes cannot 
play prior to the commencement of natural selection. Thus, even if ribozyme experiments 
succeed in significantly enhancing the capacities of RNA catalysts, it does not follow 
they will have demonstrated the plausibility of an undirected process of chemical 
evolution. Insofar as ribozyme-engineering experiments using a rational-design approach 
(as opposed to a directed-evolution approach) involve an even more overt role for 
intelligence, they exemplify the same problem (for example, see note 28). 



Conclusion 

As I have investigated various models that combined chance and necessity, I have noted 
an increasing sense of futility and frustration arising among the scientists who work on 
the origin of life. As I surveyed the literature, it became clear that this frustration had 
been building for many years. In 1980, Francis Crick lamented, “An honest man, anned 
with all the knowledge available to us now, could only state that in some sense, the origin 
of life appears at the moment to be almost a miracle, so many are the conditions which 
would have had to have been satisfied to get it going.” 44 In 1988, the German biochemist 
and origin-of-life researcher Klaus Dose followed suit with an equally critical assessment 
of the state of the field. Dose explained that research efforts to date had “led to a better 
perception of the immensity of the problem of the origin of life on earth rather than to its 
solution. At present, all discussions on principle theories and experiments in the field 
either end in a stalemate or a confession of ignorance.” 45 After attending a scientific 
conference on the origin of life in 1989, one of my Cambridge supervisors returned to 
report, “The field is becoming increasingly populated with cranks. Everyone knows 
everybody else’s theory doesn’t work, but no one is willing to admit it about his own.” 

As I reviewed and evaluated the scientific literature over the ensuing years, I found no 
reason to amend these assessments. It was demonstrably more reasonable to reject the 
chance hypothesis than to accept it. Theories relying on necessity awaited the discovery 
of an oxymoron, namely, “a law capable of producing information” — a regularity that 
could generate specified irregularity. Meanwhile, theories combining law and chance 
repeatedly begged the question as to the origin of the information they sought to explain. 
Theorists just spread pink stuff from one place to another, hoping in vain to make it 
disappear. 



Moreover, framing the possibilities differently didn’t change the situation. Protein-first 
theories had failed, giving way to DNA-first theories. DNA-first theories then failed, 




giving way to RNA-fi rst theories. And now RNA-lirst theories, like their predecessors, 
had failed to explain the central question of the origin of the information that living cells 
require. As Orgel and Joyce concluded in 1993, “The de novo appearance of 
oligonucleotides [i.e., specifically sequenced RNA bases] on the primitive earth” would 
have been “a near miracle.” 46 

But origin-of-life researchers did not give up. Instead, they had increasingly taken matters 
into their own hands. As it became more difficult to envision how life could have arisen 
from the molecular constituents of actual living cells, some evolutionary biologists 
decided to try to simulate the chemical evolutionary processes on a computer. If 
explaining how infonnation had first arisen in DNA, RNA, and proteins had proven 
prohibitively difficult, then, perhaps scientists could at least program computers to show 
how chance and necessity alone might have done the job in some other hypothetical 
context. Meanwhile, RNA chemists, frustrated by the embarrassing paucity of proteinlike 
functions associated with the new miracle molecule, decided that it was time to do a little 
engineering of their own — “ribozyme engineering” to be precise. Ironically, they sought 
to demonstrate the power of chance and necessity by intervening — by carefully isolating, 
selecting, and amplifying those specific features of RNA that could conceivably enhance 
its capacities. 

But as I examined these new approaches, 47 I found them no more convincing than those 
they were seeking to supplement. Even apart from their limited success, the very fact that 
these experiments required so much intervention seemed significant. By involving 
“programming” and “engineering” in simulations of the origin of life, these new 
approaches had introduced an elephant into the room that no one wanted to talk about, 
especially not in the methods sections of scientific papers. I began to reflect on the failure 
of these simulations to explain the DNA enigma apart from the guidance of intelligent 
scientists. I wondered if they hadn’t inadvertently provided evidence for a radically 
different approach to the problem of biological information. This led me back to where I 
had started — to the idea of intelligent design and to a consideration of the scientific case 
in its favor — a case that now I knew I could make. 
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The Best Explanation 



My family enjoys a detective show called Monk, featuring an obsessive-compulsive 
detective with almost preternatural powers of observation. Near the end of most episodes, 
there comes a point where Mr. Monk announces, “I just solved the case. Here’s what 
happened.” It’s the moment when, in a flash of insight, he at last sees how all the pieces 
fit together, how all the clues can be explained. The various competing explanations fall 
away, inadequate to the task, and the one true explanation leaps forth. 

When I was studying in Cambridge, I was repeatedly struck by the similarity between the 
historical method of scientific investigation and the method used by sleuthing detectives, 
both real and fictional. While learning about how historical scientists use abductive 
reasoning, I even found a book by the Italian scholar Gian Capretti, who used the 
fictional Sherlock Holmes detective stories to illustrate how this form of reasoning 
works . 1 

Historical scientists and detectives alike must gather clues, study each one, and then form 
tentative hypotheses. With detectives, suspects are identified; with historical scientists, 
possible causes. Both sleuths and scientists must then weigh the competing possibilities, 
favored and otherwise, judging their plausibility against a variety of clues and against 
their own knowledge of how the world works. 



Did the murder suspect have the power to strangle the victim? 
Is the proposed cause known to be adequate? 

Did the suspect have opportunity? 

Is there evidence that the cause was in fact presen t? 

Which murder scenario best explains the relevant facts? 

Which hypothesis best explains the evidence? 



Now, historical scientists, like criminal investigators, are acutely aware that they were not 
at the “scene of the crime.” They did not witness the event or its cause. Instead, they must 
depend upon circumstantial evidence, the lingering indicators of what happened: the 
clues, the traces, the signs left behind. 

As I first set out to investigate the DNA enigma I was intrigued by an outside-the-box 
hypothesis that scientists either had long since set aside or now considered verboten. 
Though the concept of design played a seminal role in the foundation of modern science, 
no one in late-twentieth-century science still regarded intelligent design as a live option 
for explaining the origin of life. Or so I had assumed. But my encounters with Charles 



Thaxton, Dean Kenyon, and others introduced me to a body of evidence that seemed — at 
least — to point toward this radical possibility. 



From everyday experience and from what I would soon learn about historical scientific 
reasoning, I knew that circumstantial evidence could be inconclusive. The same effect 
can sometimes be produced by more than one cause and, consequently, can be explained 
in several different ways. As philosopher Charles Sanders Peirce explained, abductive 
reasoning “often inclines our judgment so slightly toward its conclusion that we cannot 
say that we believe [the conclusion] to be true; we only sunnise that it may be so.” 

To address this uncertainty and to prevent premature leaps to judgment, historical 

scientists developed a method known as the “method of multiple competing hypotheses,” 

in which they seek to infer the best causal explanation of the evidence in question. As 

noted in Chapter 7, this method requires a thorough search for and evaluation of various 

possible causes, leaving — in the best of cases — only one that can explain the relevant 

facts. Historical scientists proceed in the same manner as the fictional Sherlock Holmes, 
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who, according to Capretti, uses a method of “progressively eliminating hypotheses.” 

As I investigated the DNA enigma, I consciously followed this method of reasoning. 
Specifically, I followed and tested my ideas in light of a version of the method that 
philosopher of science Michael Scriven calls “retrospective causal analysis.” Scriven’s 
method distills the key insights of Darwin and Lyell about how to identify past causes 
and about when a specifically historical scientific explanation counted as the “best.” 

Though inferences about the past can be initially fraught with uncertainty, there 
nevertheless does comes a time in many historical and forensic investigations when the 
investigator realizes that one hypothesis clearly explains the facts better than any other. 
And so it was in my case. Yet unlike Mr. Monk, the television detective with a savantlike 
intellect, I had no single “ah-hah” moment where I suddenly said, “I know what 
happened.” Instead, as I learned more about the DNA enigma, I had a series of insights: 
about the nature of information; about why the information in DNA transcends physics 
and chemistry; about why it exceeds the reach of chance; about the real take-home 
message of various simulation experiments; and about the criteria that make historical 
explanations best. As I used the historical scientific method to evaluate the evidence and 
competing explanations in light of these criteria, I eventually came to realize that the 
design hypothesis met each of them. One by one the clues fell into place, the list of 
suspects dwindled, and the pieces fit together. 



Characterizing the Effect 

In my research I learned that before historical scientists can evaluate competing 
explanations of an effect, event, or body of evidence, they usually need to define 
precisely what it is that needs to be explained. 4 Indeed, part of the historical scientific 
method of reasoning involves first characterizing what philosophers of science call the 




explanandum — the entity in need of explanation. 5 It’s no different for origin-of-life 
researchers. They need to clearly define what it is they are trying to explain the “origin 
of.” 6 

For this reason, the cell and its contents provide perhaps the most important clues about 
what happened to cause life to arise. Contemporary biology has shown that cells are a 
repository of information, and that the information they contain is essential for even 
minimal biological function. Consequently, origin-of-life researchers have realized that if 
they were to explain the origin of the first life, they would need to explain how the 
information in DNA or some other alternative molecular precursor arose. 

But what kind of infonnation does DNA possess? This was an important question to 
answer, because the term “infonnation” denotes at least two distinct concepts: Shannon 
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information, on the one hand, and functionally specified infonnation, on the other. As I 
discussed in Chapter 4, Shannon’s equations allow us to compute the 
information-carrying capacity of a sequence; they do not determine whether the sequence 
is meaningful or functional. 

DNA and other biological molecules do have large and measurable amounts 
information-carrying capacity. But they do not contain just Shannon information; they 
contain functional information. In virtue of their specific arrangements, the bases in 
coding regions of DNA and RNA and the amino acids in proteins enable these molecules 
to perform biological functions. Like the information in machine code or written 
language, biological infonnation is not just complex; it is also functionally specified. 8 

Thus, to avoid confusion and equivocation, I realized that it was necessary to distinguish: 



“information content” from mere “information-carrying capacity,” 
“specified information” from mere “Shannon infonnation,” and 
“specified complexity” from mere “complexity.” 



The first term in each of these pairs refers to the functional kind of information that DNA 
possesses. That was the kind of information I needed to explain the “origin of.” 



Assessing the Competing Possible Causes 

Having defined the effect in question as specified biological infonnation, I was now in a 
position to consider the competing explanations of this feature of life to see which 
explanation, if any, was best. Peter Lipton’s seminal Inference to the Best Explanation 
had shown me that the best explanation of an event — whether in the past or in the 
present — was one that cited a causal difference. 9 I had discovered, further, that historical 
scientists understandably preferred explanations that posited causes that were known to 
be capable of producing the effects in question over explanations that posited either no 
causes or causes lacking such power. Both an earthquake and a bomb can explain the 




